Abstract #1763

Pretraining using masked language modeling improves label noise robustness on the metadata standardization task

Ben A Duffy¹ and Ryan Chamberlain¹

¹Subtle Medical Inc., Menlo Park, CA, United States

Synopsis

Keywords: Data Processing, Software Tools

Motivation: The lack of standardization in MRI metadata increases radiologist workload.

Goal(s): To demonstrate an approach to standardize the image contrast and the body part examined information in the DICOM header and to understand the benefits of self-supervised pretraining on the metadata standardization task in the presence of label noise.

Approach: Masked language modeling was used for pretraining. At the fine-tuning stage, a transformer model was used to predict the image contrast and body part from both the text and numerical DICOM tags.

Results: Pretraining improves robustness to label noise, with there being no loss in performance at 20% label noise.

Impact: Pretraining using masked language modeling is effective at rendering a metadata stanardization system robust to label noise. Such a system can be used to standardize MRI metadata and therefore reduce radiologist workload. Future work should investigate class conditional label noise.

How to access this content:

For one year after publication, abstracts and videos are only open to registrants of this annual meeting. Registrants should use their existing login information. Non-registrant access can be purchased via the ISMRM E-Library.

After one year, current ISMRM & ISMRT members get free access to both the abstracts and videos. Non-members and non-registrants must purchase access via the ISMRM E-Library.

After two years, the meeting proceedings (abstracts) are opened to the public and require no login information. Videos remain behind password for access by members, registrants and E-Library customers.

Click here for more information on becoming a member.