Receiver Operating Characteristic Prediction for Classification: Performances in Cross-Validation by Example

The stability of receiver operating characteristic in context of random split used in development and validation sets, as compared to the full models for three inflammatory ratios (neutrophil-to-lymphocyte (NLR), derived neutrophil-to-lymphocyte (dNLR) and platelet-to-lymphocyte (PLR) ratio) evaluat...

Full description

Bibliographic Details
Main Authors: Andra Ciocan, Nadim Al Hajjar, Florin Graur, Valentin C. Oprea, Răzvan A. Ciocan, Sorana D. Bolboacă
Format: Article
Language:English
Published: MDPI AG 2020-10-01
Series:Mathematics
Subjects:
Online Access:https://www.mdpi.com/2227-7390/8/10/1741
_version_ 1797551341220921344
author Andra Ciocan
Nadim Al Hajjar
Florin Graur
Valentin C. Oprea
Răzvan A. Ciocan
Sorana D. Bolboacă
author_facet Andra Ciocan
Nadim Al Hajjar
Florin Graur
Valentin C. Oprea
Răzvan A. Ciocan
Sorana D. Bolboacă
author_sort Andra Ciocan
collection DOAJ
description The stability of receiver operating characteristic in context of random split used in development and validation sets, as compared to the full models for three inflammatory ratios (neutrophil-to-lymphocyte (NLR), derived neutrophil-to-lymphocyte (dNLR) and platelet-to-lymphocyte (PLR) ratio) evaluated as predictors for metastasis in patients with colorectal cancer, was investigated. Data belonging to patients admitted with the diagnosis of colorectal cancer from January 2014 until September 2019 in a single hospital were used. There were 1688 patients eligible for the study, 418 in the metastatic stage. All investigated inflammatory ratios proved to be significant classification models on both the full models and on cross-validations (AUCs > 0.05). High variability of the cut-off values was observed in the unrestricted and restricted split (full models: 4.255 for NLR, 2.745 for dNLR and 255.56 for PLR; random splits: cut-off from 3.215 to 5.905 for NLR, from 2.625 to 3.575 for dNLR and from 134.67 to 335.9 for PLR), but with no effect on the models characteristics or performances. The investigated biomarkes proved limited value as predictors for metastasis (AUCs < 0.8), with largely sensitivity and specificity (from 33.3% to 79.2% for the full model and 29.1% to 82.7% in the restricted splits). Our results showed that a simple random split of observations, weighting or not the patients with and whithout metastasis, in a ROC analysis assures the performances similar to the full model, if at least 70% of the available population is included in the study.
first_indexed 2024-03-10T15:44:15Z
format Article
id doaj.art-49495e6828b04da781fa61d0705e2c2e
institution Directory Open Access Journal
issn 2227-7390
language English
last_indexed 2024-03-10T15:44:15Z
publishDate 2020-10-01
publisher MDPI AG
record_format Article
series Mathematics
spelling doaj.art-49495e6828b04da781fa61d0705e2c2e2023-11-20T16:35:54ZengMDPI AGMathematics2227-73902020-10-01810174110.3390/math8101741Receiver Operating Characteristic Prediction for Classification: Performances in Cross-Validation by ExampleAndra Ciocan0Nadim Al Hajjar1Florin Graur2Valentin C. Oprea3Răzvan A. Ciocan4Sorana D. Bolboacă5Department of Medical Informatics and Biostatistics, “Iuliu Hațieganu” University of Medicine and Pharmacy Cluj-Napoca, Louis Pasteur Street, No. 6, 400349 Cluj-Napoca, Romania“Prof. Dr. Octavian Fodor” Regional Institute of Gastroenterology and Hepatology Cluj-Napoca, Croitorilor Street, No. 19–21, 400162 Cluj-Napoca, Romania“Prof. Dr. Octavian Fodor” Regional Institute of Gastroenterology and Hepatology Cluj-Napoca, Croitorilor Street, No. 19–21, 400162 Cluj-Napoca, Romania“Dr. Constantin Papilian” Military Emergency Hospital Cluj-Napoca, General Traian Moșoiu Street, No. 22, 400132 Cluj-Napoca, RomaniaDepartment of Medical Skills—Human Sciences, “Iuliu Hațieganu” University of Medicine and Pharmacy Cluj-Napoca, Marinescu Street, No. 23, 400337 Cluj-Napoca, RomaniaDepartment of Medical Informatics and Biostatistics, “Iuliu Hațieganu” University of Medicine and Pharmacy Cluj-Napoca, Louis Pasteur Street, No. 6, 400349 Cluj-Napoca, RomaniaThe stability of receiver operating characteristic in context of random split used in development and validation sets, as compared to the full models for three inflammatory ratios (neutrophil-to-lymphocyte (NLR), derived neutrophil-to-lymphocyte (dNLR) and platelet-to-lymphocyte (PLR) ratio) evaluated as predictors for metastasis in patients with colorectal cancer, was investigated. Data belonging to patients admitted with the diagnosis of colorectal cancer from January 2014 until September 2019 in a single hospital were used. There were 1688 patients eligible for the study, 418 in the metastatic stage. All investigated inflammatory ratios proved to be significant classification models on both the full models and on cross-validations (AUCs > 0.05). High variability of the cut-off values was observed in the unrestricted and restricted split (full models: 4.255 for NLR, 2.745 for dNLR and 255.56 for PLR; random splits: cut-off from 3.215 to 5.905 for NLR, from 2.625 to 3.575 for dNLR and from 134.67 to 335.9 for PLR), but with no effect on the models characteristics or performances. The investigated biomarkes proved limited value as predictors for metastasis (AUCs < 0.8), with largely sensitivity and specificity (from 33.3% to 79.2% for the full model and 29.1% to 82.7% in the restricted splits). Our results showed that a simple random split of observations, weighting or not the patients with and whithout metastasis, in a ROC analysis assures the performances similar to the full model, if at least 70% of the available population is included in the study.https://www.mdpi.com/2227-7390/8/10/1741receiver operating characteristicarea under the curvemodels performancesbiomarkersmetastasis
spellingShingle Andra Ciocan
Nadim Al Hajjar
Florin Graur
Valentin C. Oprea
Răzvan A. Ciocan
Sorana D. Bolboacă
Receiver Operating Characteristic Prediction for Classification: Performances in Cross-Validation by Example
Mathematics
receiver operating characteristic
area under the curve
models performances
biomarkers
metastasis
title Receiver Operating Characteristic Prediction for Classification: Performances in Cross-Validation by Example
title_full Receiver Operating Characteristic Prediction for Classification: Performances in Cross-Validation by Example
title_fullStr Receiver Operating Characteristic Prediction for Classification: Performances in Cross-Validation by Example
title_full_unstemmed Receiver Operating Characteristic Prediction for Classification: Performances in Cross-Validation by Example
title_short Receiver Operating Characteristic Prediction for Classification: Performances in Cross-Validation by Example
title_sort receiver operating characteristic prediction for classification performances in cross validation by example
topic receiver operating characteristic
area under the curve
models performances
biomarkers
metastasis
url https://www.mdpi.com/2227-7390/8/10/1741
work_keys_str_mv AT andraciocan receiveroperatingcharacteristicpredictionforclassificationperformancesincrossvalidationbyexample
AT nadimalhajjar receiveroperatingcharacteristicpredictionforclassificationperformancesincrossvalidationbyexample
AT floringraur receiveroperatingcharacteristicpredictionforclassificationperformancesincrossvalidationbyexample
AT valentincoprea receiveroperatingcharacteristicpredictionforclassificationperformancesincrossvalidationbyexample
AT razvanaciocan receiveroperatingcharacteristicpredictionforclassificationperformancesincrossvalidationbyexample
AT soranadbolboaca receiveroperatingcharacteristicpredictionforclassificationperformancesincrossvalidationbyexample