The clinical meaning of the area under a receiver operating characteristic curve for the evaluation of the performance of disease markers

OBJECTIVES The area under a receiver operating characteristic (ROC) curve (AUC) is a popular measure of pure diagnostic accuracy that is independent from the proportion of diseased subjects in the analysed sample. However, its actual usefulness in the clinical context has been questioned, because it...

Full description

Bibliographic Details
Main Authors: Stefano Parodi, Damiano Verda, Francesca Bagnasco, Marco Muselli
Format: Article
Language:English
Published: Korean Society of Epidemiology 2022-10-01
Series:Epidemiology and Health
Subjects:
Online Access:http://www.e-epih.org/upload/epih-44-e2022088.pdf
_version_ 1797768377569116160
author Stefano Parodi
Damiano Verda
Francesca Bagnasco
Marco Muselli
author_facet Stefano Parodi
Damiano Verda
Francesca Bagnasco
Marco Muselli
author_sort Stefano Parodi
collection DOAJ
description OBJECTIVES The area under a receiver operating characteristic (ROC) curve (AUC) is a popular measure of pure diagnostic accuracy that is independent from the proportion of diseased subjects in the analysed sample. However, its actual usefulness in the clinical context has been questioned, because it does not seem to be directly related to the actual performance of a diagnostic marker in identifying diseased and non-diseased subjects in real clinical settings. This study evaluates the relationship between the AUC and the proportion of correct classifications (global diagnostic accuracy, GDA) in relation to the shape of the corresponding ROC curves. METHODS We demonstrate that AUC represents an upward-biased measure of GDA at an optimal accuracy cut-off for balanced groups. The magnitude of bias depends on the shape of the ROC plot and on the proportion of diseased and non-diseased subjects. In proper curves, the bias is independent from the diseased/non-diseased ratio and can be easily estimated and removed. Moreover, a comparison between 2 partial AUCs can be replaced by a more powerful test for the corresponding whole AUCs. RESULTS Applications to 3 real datasets are provided: a marker for a hormone deficit in children, 2 tumour markers for malignant mesothelioma, and 2 gene expression profiles in ovarian cancer patients. CONCLUSIONS The AUC is a measure of accuracy with potential clinical relevance for the evaluation of disease markers. The clinical meaning of ROC parameters should always be evaluated with an analysis of the shape of the corresponding ROC curve.
first_indexed 2024-03-12T20:53:37Z
format Article
id doaj.art-e82f12f8150049dca2ce179b4ed19bb5
institution Directory Open Access Journal
issn 2092-7193
language English
last_indexed 2024-03-12T20:53:37Z
publishDate 2022-10-01
publisher Korean Society of Epidemiology
record_format Article
series Epidemiology and Health
spelling doaj.art-e82f12f8150049dca2ce179b4ed19bb52023-08-01T00:41:57ZengKorean Society of EpidemiologyEpidemiology and Health2092-71932022-10-014410.4178/epih.e20220881336The clinical meaning of the area under a receiver operating characteristic curve for the evaluation of the performance of disease markersStefano ParodiDamiano VerdaFrancesca BagnascoMarco MuselliOBJECTIVES The area under a receiver operating characteristic (ROC) curve (AUC) is a popular measure of pure diagnostic accuracy that is independent from the proportion of diseased subjects in the analysed sample. However, its actual usefulness in the clinical context has been questioned, because it does not seem to be directly related to the actual performance of a diagnostic marker in identifying diseased and non-diseased subjects in real clinical settings. This study evaluates the relationship between the AUC and the proportion of correct classifications (global diagnostic accuracy, GDA) in relation to the shape of the corresponding ROC curves. METHODS We demonstrate that AUC represents an upward-biased measure of GDA at an optimal accuracy cut-off for balanced groups. The magnitude of bias depends on the shape of the ROC plot and on the proportion of diseased and non-diseased subjects. In proper curves, the bias is independent from the diseased/non-diseased ratio and can be easily estimated and removed. Moreover, a comparison between 2 partial AUCs can be replaced by a more powerful test for the corresponding whole AUCs. RESULTS Applications to 3 real datasets are provided: a marker for a hormone deficit in children, 2 tumour markers for malignant mesothelioma, and 2 gene expression profiles in ovarian cancer patients. CONCLUSIONS The AUC is a measure of accuracy with potential clinical relevance for the evaluation of disease markers. The clinical meaning of ROC parameters should always be evaluated with an analysis of the shape of the corresponding ROC curve.http://www.e-epih.org/upload/epih-44-e2022088.pdfreceiver operating characteristic curvebiomarkersdiagnostic accuracybinormal modelmetz and kronman test
spellingShingle Stefano Parodi
Damiano Verda
Francesca Bagnasco
Marco Muselli
The clinical meaning of the area under a receiver operating characteristic curve for the evaluation of the performance of disease markers
Epidemiology and Health
receiver operating characteristic curve
biomarkers
diagnostic accuracy
binormal model
metz and kronman test
title The clinical meaning of the area under a receiver operating characteristic curve for the evaluation of the performance of disease markers
title_full The clinical meaning of the area under a receiver operating characteristic curve for the evaluation of the performance of disease markers
title_fullStr The clinical meaning of the area under a receiver operating characteristic curve for the evaluation of the performance of disease markers
title_full_unstemmed The clinical meaning of the area under a receiver operating characteristic curve for the evaluation of the performance of disease markers
title_short The clinical meaning of the area under a receiver operating characteristic curve for the evaluation of the performance of disease markers
title_sort clinical meaning of the area under a receiver operating characteristic curve for the evaluation of the performance of disease markers
topic receiver operating characteristic curve
biomarkers
diagnostic accuracy
binormal model
metz and kronman test
url http://www.e-epih.org/upload/epih-44-e2022088.pdf
work_keys_str_mv AT stefanoparodi theclinicalmeaningoftheareaunderareceiveroperatingcharacteristiccurvefortheevaluationoftheperformanceofdiseasemarkers
AT damianoverda theclinicalmeaningoftheareaunderareceiveroperatingcharacteristiccurvefortheevaluationoftheperformanceofdiseasemarkers
AT francescabagnasco theclinicalmeaningoftheareaunderareceiveroperatingcharacteristiccurvefortheevaluationoftheperformanceofdiseasemarkers
AT marcomuselli theclinicalmeaningoftheareaunderareceiveroperatingcharacteristiccurvefortheevaluationoftheperformanceofdiseasemarkers
AT stefanoparodi clinicalmeaningoftheareaunderareceiveroperatingcharacteristiccurvefortheevaluationoftheperformanceofdiseasemarkers
AT damianoverda clinicalmeaningoftheareaunderareceiveroperatingcharacteristiccurvefortheevaluationoftheperformanceofdiseasemarkers
AT francescabagnasco clinicalmeaningoftheareaunderareceiveroperatingcharacteristiccurvefortheevaluationoftheperformanceofdiseasemarkers
AT marcomuselli clinicalmeaningoftheareaunderareceiveroperatingcharacteristiccurvefortheevaluationoftheperformanceofdiseasemarkers