ChatGPT in glioma adjuvant therapy decision making: ready to assume the role of a doctor in the tumour board?

Objective To evaluate ChatGPT‘s performance in brain glioma adjuvant therapy decision-making.Methods We randomly selected 10 patients with brain gliomas discussed at our institution’s central nervous system tumour board (CNS TB). Patients’ clinical status, surgical outcome, textual imaging informati...

Full description

Bibliographic Details
Main Authors: Karl Schaller, Philippe Bijlenga, Aria Nouri, Kristof Egervari, Shahan Momjian, Julien Haemmerli, Lukas Sveikata, Adrien May, Christian Freyschlag, Johannes A Lobrinus, Denis Migliorini, Nicolae Sanda, Sebastien Tran, Jacky Yeung
Format: Article
Language:English
Published: BMJ Publishing Group 2023-06-01
Series:BMJ Health & Care Informatics
Online Access:https://informatics.bmj.com/content/30/1/e100775.full
_version_ 1827153061763612672
author Karl Schaller
Philippe Bijlenga
Aria Nouri
Kristof Egervari
Shahan Momjian
Julien Haemmerli
Lukas Sveikata
Adrien May
Christian Freyschlag
Johannes A Lobrinus
Denis Migliorini
Nicolae Sanda
Sebastien Tran
Jacky Yeung
author_facet Karl Schaller
Philippe Bijlenga
Aria Nouri
Kristof Egervari
Shahan Momjian
Julien Haemmerli
Lukas Sveikata
Adrien May
Christian Freyschlag
Johannes A Lobrinus
Denis Migliorini
Nicolae Sanda
Sebastien Tran
Jacky Yeung
author_sort Karl Schaller
collection DOAJ
description Objective To evaluate ChatGPT‘s performance in brain glioma adjuvant therapy decision-making.Methods We randomly selected 10 patients with brain gliomas discussed at our institution’s central nervous system tumour board (CNS TB). Patients’ clinical status, surgical outcome, textual imaging information and immuno-pathology results were provided to ChatGPT V.3.5 and seven CNS tumour experts. The chatbot was asked to give the adjuvant treatment choice, and the regimen while considering the patient’s functional status. The experts rated the artificial intelligence-based recommendations from 0 (complete disagreement) to 10 (complete agreement). An intraclass correlation coefficient agreement (ICC) was used to measure the inter-rater agreement.Results Eight patients (80%) met the criteria for glioblastoma and two (20%) were low-grade gliomas. The experts rated the quality of ChatGPT recommendations as poor for diagnosis (median 3, IQR 1–7.8, ICC 0.9, 95% CI 0.7 to 1.0), good for treatment recommendation (7, IQR 6–8, ICC 0.8, 95% CI 0.4 to 0.9), good for therapy regimen (7, IQR 4–8, ICC 0.8, 95% CI 0.5 to 0.9), moderate for functional status consideration (6, IQR 1–7, ICC 0.7, 95% CI 0.3 to 0.9) and moderate for overall agreement with the recommendations (5, IQR 3–7, ICC 0.7, 95% CI 0.3 to 0.9). No differences were observed between the glioblastomas and low-grade glioma ratings.Conclusions ChatGPT performed poorly in classifying glioma types but was good for adjuvant treatment recommendations as evaluated by CNS TB experts. Even though the ChatGPT lacks the precision to replace expert opinion, it may serve as a promising supplemental tool within a human-in-the-loop approach.
first_indexed 2024-03-08T16:37:53Z
format Article
id doaj.art-1242d5649eb64f529d71806a749af804
institution Directory Open Access Journal
issn 2632-1009
language English
last_indexed 2025-03-20T22:15:47Z
publishDate 2023-06-01
publisher BMJ Publishing Group
record_format Article
series BMJ Health & Care Informatics
spelling doaj.art-1242d5649eb64f529d71806a749af8042024-08-09T01:15:10ZengBMJ Publishing GroupBMJ Health & Care Informatics2632-10092023-06-0130110.1136/bmjhci-2023-100775ChatGPT in glioma adjuvant therapy decision making: ready to assume the role of a doctor in the tumour board?Karl Schaller0Philippe Bijlenga1Aria Nouri2Kristof Egervari3Shahan Momjian4Julien Haemmerli5Lukas Sveikata6Adrien May7Christian Freyschlag8Johannes A Lobrinus9Denis Migliorini10Nicolae Sanda11Sebastien Tran12Jacky Yeung13Department of Clinical Neurosciences, Division of Neurosurgery, Geneva University Hospitals, Geneva, SwitzerlandDepartment of Clinical Neurosciences, Division of Neurosurgery, Geneva University Hospitals, Geneva, SwitzerlandDivision of Neurosurgery, Geneva University Hospitals, Geneva, SwitzerlandDepartment of Pathology and Immunology, Geneva University Hospitals, Geneva, SwitzerlandDepartment of Clinical Neurosciences, Division of Neurosurgery, Geneva University Hospitals, Geneva, SwitzerlandDepartment of Clinical Neurosciences, Division of Neurosurgery, Geneva University Hospitals, Geneva, SwitzerlandDepartment of Clinical Neurosciences, Division of Neurology, Geneva University Hospitals, Geneva, SwitzerlandDepartment of Clinical Neurosciences, Division of Neurosurgery, Geneva University Hospitals, Geneva, SwitzerlandDepartment of Neurosurgery, Medical University of Innsbruck, Innsbruck, AustriaDepartment of Pathology and Immunology, Geneva University Hospitals, Geneva, SwitzerlandDepartment of Oncology, Geneva University Hospitals, Geneva, SwitzerlandDepartment of Clinical Neurosciences, Division of Neurology, Geneva University Hospitals, Geneva, SwitzerlandDepartment of Radiation Oncology, Geneva University Hospitals, Geneva, SwitzerlandDepartment of Neurosurgery, Yale University School of Medicine, New Haven, Connecticut, USAObjective To evaluate ChatGPT‘s performance in brain glioma adjuvant therapy decision-making.Methods We randomly selected 10 patients with brain gliomas discussed at our institution’s central nervous system tumour board (CNS TB). Patients’ clinical status, surgical outcome, textual imaging information and immuno-pathology results were provided to ChatGPT V.3.5 and seven CNS tumour experts. The chatbot was asked to give the adjuvant treatment choice, and the regimen while considering the patient’s functional status. The experts rated the artificial intelligence-based recommendations from 0 (complete disagreement) to 10 (complete agreement). An intraclass correlation coefficient agreement (ICC) was used to measure the inter-rater agreement.Results Eight patients (80%) met the criteria for glioblastoma and two (20%) were low-grade gliomas. The experts rated the quality of ChatGPT recommendations as poor for diagnosis (median 3, IQR 1–7.8, ICC 0.9, 95% CI 0.7 to 1.0), good for treatment recommendation (7, IQR 6–8, ICC 0.8, 95% CI 0.4 to 0.9), good for therapy regimen (7, IQR 4–8, ICC 0.8, 95% CI 0.5 to 0.9), moderate for functional status consideration (6, IQR 1–7, ICC 0.7, 95% CI 0.3 to 0.9) and moderate for overall agreement with the recommendations (5, IQR 3–7, ICC 0.7, 95% CI 0.3 to 0.9). No differences were observed between the glioblastomas and low-grade glioma ratings.Conclusions ChatGPT performed poorly in classifying glioma types but was good for adjuvant treatment recommendations as evaluated by CNS TB experts. Even though the ChatGPT lacks the precision to replace expert opinion, it may serve as a promising supplemental tool within a human-in-the-loop approach.https://informatics.bmj.com/content/30/1/e100775.full
spellingShingle Karl Schaller
Philippe Bijlenga
Aria Nouri
Kristof Egervari
Shahan Momjian
Julien Haemmerli
Lukas Sveikata
Adrien May
Christian Freyschlag
Johannes A Lobrinus
Denis Migliorini
Nicolae Sanda
Sebastien Tran
Jacky Yeung
ChatGPT in glioma adjuvant therapy decision making: ready to assume the role of a doctor in the tumour board?
BMJ Health & Care Informatics
title ChatGPT in glioma adjuvant therapy decision making: ready to assume the role of a doctor in the tumour board?
title_full ChatGPT in glioma adjuvant therapy decision making: ready to assume the role of a doctor in the tumour board?
title_fullStr ChatGPT in glioma adjuvant therapy decision making: ready to assume the role of a doctor in the tumour board?
title_full_unstemmed ChatGPT in glioma adjuvant therapy decision making: ready to assume the role of a doctor in the tumour board?
title_short ChatGPT in glioma adjuvant therapy decision making: ready to assume the role of a doctor in the tumour board?
title_sort chatgpt in glioma adjuvant therapy decision making ready to assume the role of a doctor in the tumour board
url https://informatics.bmj.com/content/30/1/e100775.full
work_keys_str_mv AT karlschaller chatgptingliomaadjuvanttherapydecisionmakingreadytoassumetheroleofadoctorinthetumourboard
AT philippebijlenga chatgptingliomaadjuvanttherapydecisionmakingreadytoassumetheroleofadoctorinthetumourboard
AT arianouri chatgptingliomaadjuvanttherapydecisionmakingreadytoassumetheroleofadoctorinthetumourboard
AT kristofegervari chatgptingliomaadjuvanttherapydecisionmakingreadytoassumetheroleofadoctorinthetumourboard
AT shahanmomjian chatgptingliomaadjuvanttherapydecisionmakingreadytoassumetheroleofadoctorinthetumourboard
AT julienhaemmerli chatgptingliomaadjuvanttherapydecisionmakingreadytoassumetheroleofadoctorinthetumourboard
AT lukassveikata chatgptingliomaadjuvanttherapydecisionmakingreadytoassumetheroleofadoctorinthetumourboard
AT adrienmay chatgptingliomaadjuvanttherapydecisionmakingreadytoassumetheroleofadoctorinthetumourboard
AT christianfreyschlag chatgptingliomaadjuvanttherapydecisionmakingreadytoassumetheroleofadoctorinthetumourboard
AT johannesalobrinus chatgptingliomaadjuvanttherapydecisionmakingreadytoassumetheroleofadoctorinthetumourboard
AT denismigliorini chatgptingliomaadjuvanttherapydecisionmakingreadytoassumetheroleofadoctorinthetumourboard
AT nicolaesanda chatgptingliomaadjuvanttherapydecisionmakingreadytoassumetheroleofadoctorinthetumourboard
AT sebastientran chatgptingliomaadjuvanttherapydecisionmakingreadytoassumetheroleofadoctorinthetumourboard
AT jackyyeung chatgptingliomaadjuvanttherapydecisionmakingreadytoassumetheroleofadoctorinthetumourboard