Estimating cutoff values for diagnostic tests to achieve target specificity using extreme value theory

Abstract Background Rapidly developing tests for emerging diseases is critical for early disease monitoring. In the early stages of an epidemic, when low prevalences are expected, high specificity tests are desired to avoid numerous false positives. Selecting a cutoff to classify positive and negati...

Full description

Bibliographic Details
Main Authors: Sierra Pugh, Bailey K. Fosdick, Mary Nehring, Emily N. Gallichotte, Sue VandeWoude, Ander Wilson
Format: Article
Language:English
Published: BMC 2024-02-01
Series:BMC Medical Research Methodology
Subjects:
Online Access:https://doi.org/10.1186/s12874-023-02139-5
_version_ 1797274168933220352
author Sierra Pugh
Bailey K. Fosdick
Mary Nehring
Emily N. Gallichotte
Sue VandeWoude
Ander Wilson
author_facet Sierra Pugh
Bailey K. Fosdick
Mary Nehring
Emily N. Gallichotte
Sue VandeWoude
Ander Wilson
author_sort Sierra Pugh
collection DOAJ
description Abstract Background Rapidly developing tests for emerging diseases is critical for early disease monitoring. In the early stages of an epidemic, when low prevalences are expected, high specificity tests are desired to avoid numerous false positives. Selecting a cutoff to classify positive and negative test results that has the desired operating characteristics, such as specificity, is challenging for new tests because of limited validation data with known disease status. While there is ample statistical literature on estimating quantiles of a distribution, there is limited evidence on estimating extreme quantiles from limited validation data and the resulting test characteristics in the disease testing context. Methods We propose using extreme value theory to select a cutoff with predetermined specificity by fitting a Pareto distribution to the upper tail of the negative controls. We compared this method to five previously proposed cutoff selection methods in a data analysis and simulation study. We analyzed COVID-19 enzyme linked immunosorbent assay antibody test results from long-term care facilities and skilled nursing staff in Colorado between May and December of 2020. Results We found the extreme value approach had minimal bias when targeting a specificity of 0.995. Using the empirical quantile of the negative controls performed well when targeting a specificity of 0.95. The higher target specificity is preferred for overall test accuracy when prevalence is low, whereas the lower target specificity is preferred when prevalence is higher and resulted in less variable prevalence estimation. Discussion While commonly used, the normal based methods showed considerable bias compared to the empirical and extreme value theory-based methods. Conclusions When determining disease testing cutoffs from small training data samples, we recommend using the extreme value based-methods when targeting a high specificity and the empirical quantile when targeting a lower specificity.
first_indexed 2024-03-07T14:55:31Z
format Article
id doaj.art-6db8c817da294352842da54c0793fa73
institution Directory Open Access Journal
issn 1471-2288
language English
last_indexed 2024-03-07T14:55:31Z
publishDate 2024-02-01
publisher BMC
record_format Article
series BMC Medical Research Methodology
spelling doaj.art-6db8c817da294352842da54c0793fa732024-03-05T19:28:28ZengBMCBMC Medical Research Methodology1471-22882024-02-0124111410.1186/s12874-023-02139-5Estimating cutoff values for diagnostic tests to achieve target specificity using extreme value theorySierra Pugh0Bailey K. Fosdick1Mary Nehring2Emily N. Gallichotte3Sue VandeWoude4Ander Wilson5Department of Statistics, Colorado State UniversityDepartment of Biostatistics and Informatics, Colorado School of Public HealthDepartment of Microbiology, Immunology, and Pathology, Colorado State UniversityDepartment of Microbiology, Immunology, and Pathology, Colorado State UniversityDepartment of Microbiology, Immunology, and Pathology, Colorado State UniversityDepartment of Statistics, Colorado State UniversityAbstract Background Rapidly developing tests for emerging diseases is critical for early disease monitoring. In the early stages of an epidemic, when low prevalences are expected, high specificity tests are desired to avoid numerous false positives. Selecting a cutoff to classify positive and negative test results that has the desired operating characteristics, such as specificity, is challenging for new tests because of limited validation data with known disease status. While there is ample statistical literature on estimating quantiles of a distribution, there is limited evidence on estimating extreme quantiles from limited validation data and the resulting test characteristics in the disease testing context. Methods We propose using extreme value theory to select a cutoff with predetermined specificity by fitting a Pareto distribution to the upper tail of the negative controls. We compared this method to five previously proposed cutoff selection methods in a data analysis and simulation study. We analyzed COVID-19 enzyme linked immunosorbent assay antibody test results from long-term care facilities and skilled nursing staff in Colorado between May and December of 2020. Results We found the extreme value approach had minimal bias when targeting a specificity of 0.995. Using the empirical quantile of the negative controls performed well when targeting a specificity of 0.95. The higher target specificity is preferred for overall test accuracy when prevalence is low, whereas the lower target specificity is preferred when prevalence is higher and resulted in less variable prevalence estimation. Discussion While commonly used, the normal based methods showed considerable bias compared to the empirical and extreme value theory-based methods. Conclusions When determining disease testing cutoffs from small training data samples, we recommend using the extreme value based-methods when targeting a high specificity and the empirical quantile when targeting a lower specificity.https://doi.org/10.1186/s12874-023-02139-5Antibody testCut pointExtreme value theorySerologyThreshold
spellingShingle Sierra Pugh
Bailey K. Fosdick
Mary Nehring
Emily N. Gallichotte
Sue VandeWoude
Ander Wilson
Estimating cutoff values for diagnostic tests to achieve target specificity using extreme value theory
BMC Medical Research Methodology
Antibody test
Cut point
Extreme value theory
Serology
Threshold
title Estimating cutoff values for diagnostic tests to achieve target specificity using extreme value theory
title_full Estimating cutoff values for diagnostic tests to achieve target specificity using extreme value theory
title_fullStr Estimating cutoff values for diagnostic tests to achieve target specificity using extreme value theory
title_full_unstemmed Estimating cutoff values for diagnostic tests to achieve target specificity using extreme value theory
title_short Estimating cutoff values for diagnostic tests to achieve target specificity using extreme value theory
title_sort estimating cutoff values for diagnostic tests to achieve target specificity using extreme value theory
topic Antibody test
Cut point
Extreme value theory
Serology
Threshold
url https://doi.org/10.1186/s12874-023-02139-5
work_keys_str_mv AT sierrapugh estimatingcutoffvaluesfordiagnosticteststoachievetargetspecificityusingextremevaluetheory
AT baileykfosdick estimatingcutoffvaluesfordiagnosticteststoachievetargetspecificityusingextremevaluetheory
AT marynehring estimatingcutoffvaluesfordiagnosticteststoachievetargetspecificityusingextremevaluetheory
AT emilyngallichotte estimatingcutoffvaluesfordiagnosticteststoachievetargetspecificityusingextremevaluetheory
AT suevandewoude estimatingcutoffvaluesfordiagnosticteststoachievetargetspecificityusingextremevaluetheory
AT anderwilson estimatingcutoffvaluesfordiagnosticteststoachievetargetspecificityusingextremevaluetheory