Automated spinal MRI labelling from reports using a large language model

We propose a general pipeline to automate the extraction of labels from radiology reports using large language models, which we validate on spinal MRI reports. The efficacy of our method is measured on two distinct conditions: spinal cancer and stenosis. Using open-source models, our method surpasse...

Full description

Bibliographic Details
Main Authors: Park, RY, Windsor, R, Jamaludin, A, Zisserman, A
Format: Conference item
Language:English
Published: Springer 2024
_version_ 1817931414948020224
author Park, RY
Windsor, R
Jamaludin, A
Zisserman, A
author_facet Park, RY
Windsor, R
Jamaludin, A
Zisserman, A
author_sort Park, RY
collection OXFORD
description We propose a general pipeline to automate the extraction of labels from radiology reports using large language models, which we validate on spinal MRI reports. The efficacy of our method is measured on two distinct conditions: spinal cancer and stenosis. Using open-source models, our method surpasses GPT-4 on a held-out set of reports. Furthermore, we show that the extracted labels can be used to train an imaging model to classify the identified conditions in the accompanying MR scans. Both the cancer and stenosis classifiers trained using automated labels achieve comparable performance to models trained using scans manually annotated by clinicians. Code can be found at https://github.com/robinyjpark/AutoLabelClassifier.
first_indexed 2024-12-09T03:21:39Z
format Conference item
id oxford-uuid:13d403eb-ec9f-4701-9b7d-a7e9a5a6d75d
institution University of Oxford
language English
last_indexed 2024-12-09T03:21:39Z
publishDate 2024
publisher Springer
record_format dspace
spelling oxford-uuid:13d403eb-ec9f-4701-9b7d-a7e9a5a6d75d2024-11-19T13:30:15ZAutomated spinal MRI labelling from reports using a large language modelConference itemhttp://purl.org/coar/resource_type/c_5794uuid:13d403eb-ec9f-4701-9b7d-a7e9a5a6d75dEnglishSymplectic ElementsSpringer2024Park, RYWindsor, RJamaludin, AZisserman, AWe propose a general pipeline to automate the extraction of labels from radiology reports using large language models, which we validate on spinal MRI reports. The efficacy of our method is measured on two distinct conditions: spinal cancer and stenosis. Using open-source models, our method surpasses GPT-4 on a held-out set of reports. Furthermore, we show that the extracted labels can be used to train an imaging model to classify the identified conditions in the accompanying MR scans. Both the cancer and stenosis classifiers trained using automated labels achieve comparable performance to models trained using scans manually annotated by clinicians. Code can be found at https://github.com/robinyjpark/AutoLabelClassifier.
spellingShingle Park, RY
Windsor, R
Jamaludin, A
Zisserman, A
Automated spinal MRI labelling from reports using a large language model
title Automated spinal MRI labelling from reports using a large language model
title_full Automated spinal MRI labelling from reports using a large language model
title_fullStr Automated spinal MRI labelling from reports using a large language model
title_full_unstemmed Automated spinal MRI labelling from reports using a large language model
title_short Automated spinal MRI labelling from reports using a large language model
title_sort automated spinal mri labelling from reports using a large language model
work_keys_str_mv AT parkry automatedspinalmrilabellingfromreportsusingalargelanguagemodel
AT windsorr automatedspinalmrilabellingfromreportsusingalargelanguagemodel
AT jamaludina automatedspinalmrilabellingfromreportsusingalargelanguagemodel
AT zissermana automatedspinalmrilabellingfromreportsusingalargelanguagemodel