Automated spinal MRI labelling from reports using a large language model
We propose a general pipeline to automate the extraction of labels from radiology reports using large language models, which we validate on spinal MRI reports. The efficacy of our method is measured on two distinct conditions: spinal cancer and stenosis. Using open-source models, our method surpasse...
Main Authors: | , , , |
---|---|
Format: | Conference item |
Language: | English |
Published: |
Springer
2024
|
_version_ | 1817931414948020224 |
---|---|
author | Park, RY Windsor, R Jamaludin, A Zisserman, A |
author_facet | Park, RY Windsor, R Jamaludin, A Zisserman, A |
author_sort | Park, RY |
collection | OXFORD |
description | We propose a general pipeline to automate the extraction of labels from radiology reports using large language models, which we validate on spinal MRI reports. The efficacy of our method is measured on two distinct conditions: spinal cancer and stenosis. Using open-source models, our method surpasses GPT-4 on a held-out set of reports. Furthermore, we show that the extracted labels can be used to train an imaging model to classify the identified conditions in the accompanying MR scans. Both the cancer and stenosis classifiers trained using automated labels achieve comparable performance to models trained using scans manually annotated by clinicians. Code can be found at https://github.com/robinyjpark/AutoLabelClassifier. |
first_indexed | 2024-12-09T03:21:39Z |
format | Conference item |
id | oxford-uuid:13d403eb-ec9f-4701-9b7d-a7e9a5a6d75d |
institution | University of Oxford |
language | English |
last_indexed | 2024-12-09T03:21:39Z |
publishDate | 2024 |
publisher | Springer |
record_format | dspace |
spelling | oxford-uuid:13d403eb-ec9f-4701-9b7d-a7e9a5a6d75d2024-11-19T13:30:15ZAutomated spinal MRI labelling from reports using a large language modelConference itemhttp://purl.org/coar/resource_type/c_5794uuid:13d403eb-ec9f-4701-9b7d-a7e9a5a6d75dEnglishSymplectic ElementsSpringer2024Park, RYWindsor, RJamaludin, AZisserman, AWe propose a general pipeline to automate the extraction of labels from radiology reports using large language models, which we validate on spinal MRI reports. The efficacy of our method is measured on two distinct conditions: spinal cancer and stenosis. Using open-source models, our method surpasses GPT-4 on a held-out set of reports. Furthermore, we show that the extracted labels can be used to train an imaging model to classify the identified conditions in the accompanying MR scans. Both the cancer and stenosis classifiers trained using automated labels achieve comparable performance to models trained using scans manually annotated by clinicians. Code can be found at https://github.com/robinyjpark/AutoLabelClassifier. |
spellingShingle | Park, RY Windsor, R Jamaludin, A Zisserman, A Automated spinal MRI labelling from reports using a large language model |
title | Automated spinal MRI labelling from reports using a large language model |
title_full | Automated spinal MRI labelling from reports using a large language model |
title_fullStr | Automated spinal MRI labelling from reports using a large language model |
title_full_unstemmed | Automated spinal MRI labelling from reports using a large language model |
title_short | Automated spinal MRI labelling from reports using a large language model |
title_sort | automated spinal mri labelling from reports using a large language model |
work_keys_str_mv | AT parkry automatedspinalmrilabellingfromreportsusingalargelanguagemodel AT windsorr automatedspinalmrilabellingfromreportsusingalargelanguagemodel AT jamaludina automatedspinalmrilabellingfromreportsusingalargelanguagemodel AT zissermana automatedspinalmrilabellingfromreportsusingalargelanguagemodel |