Estimating cutoff values for diagnostic tests to achieve target specificity using extreme value theory

Abstract Background Rapidly developing tests for emerging diseases is critical for early disease monitoring. In the early stages of an epidemic, when low prevalences are expected, high specificity tests are desired to avoid numerous false positives. Selecting a cutoff to classify positive and negati...

Full description

Bibliographic Details
Main Authors:	Sierra Pugh, Bailey K. Fosdick, Mary Nehring, Emily N. Gallichotte, Sue VandeWoude, Ander Wilson
Format:	Article
Language:	English
Published:	BMC 2024-02-01
Series:	BMC Medical Research Methodology
Subjects:	Antibody test Cut point Extreme value theory Serology Threshold
Online Access:	https://doi.org/10.1186/s12874-023-02139-5

_version_	1797274168933220352
author	Sierra Pugh Bailey K. Fosdick Mary Nehring Emily N. Gallichotte Sue VandeWoude Ander Wilson
author_facet	Sierra Pugh Bailey K. Fosdick Mary Nehring Emily N. Gallichotte Sue VandeWoude Ander Wilson
author_sort	Sierra Pugh
collection	DOAJ
description	Abstract Background Rapidly developing tests for emerging diseases is critical for early disease monitoring. In the early stages of an epidemic, when low prevalences are expected, high specificity tests are desired to avoid numerous false positives. Selecting a cutoff to classify positive and negative test results that has the desired operating characteristics, such as specificity, is challenging for new tests because of limited validation data with known disease status. While there is ample statistical literature on estimating quantiles of a distribution, there is limited evidence on estimating extreme quantiles from limited validation data and the resulting test characteristics in the disease testing context. Methods We propose using extreme value theory to select a cutoff with predetermined specificity by fitting a Pareto distribution to the upper tail of the negative controls. We compared this method to five previously proposed cutoff selection methods in a data analysis and simulation study. We analyzed COVID-19 enzyme linked immunosorbent assay antibody test results from long-term care facilities and skilled nursing staff in Colorado between May and December of 2020. Results We found the extreme value approach had minimal bias when targeting a specificity of 0.995. Using the empirical quantile of the negative controls performed well when targeting a specificity of 0.95. The higher target specificity is preferred for overall test accuracy when prevalence is low, whereas the lower target specificity is preferred when prevalence is higher and resulted in less variable prevalence estimation. Discussion While commonly used, the normal based methods showed considerable bias compared to the empirical and extreme value theory-based methods. Conclusions When determining disease testing cutoffs from small training data samples, we recommend using the extreme value based-methods when targeting a high specificity and the empirical quantile when targeting a lower specificity.
first_indexed	2024-03-07T14:55:31Z
format	Article
id	doaj.art-6db8c817da294352842da54c0793fa73
institution	Directory Open Access Journal
issn	1471-2288
language	English
last_indexed	2024-03-07T14:55:31Z
publishDate	2024-02-01
publisher	BMC
record_format	Article
series	BMC Medical Research Methodology
spelling	doaj.art-6db8c817da294352842da54c0793fa732024-03-05T19:28:28ZengBMCBMC Medical Research Methodology1471-22882024-02-0124111410.1186/s12874-023-02139-5Estimating cutoff values for diagnostic tests to achieve target specificity using extreme value theorySierra Pugh0Bailey K. Fosdick1Mary Nehring2Emily N. Gallichotte3Sue VandeWoude4Ander Wilson5Department of Statistics, Colorado State UniversityDepartment of Biostatistics and Informatics, Colorado School of Public HealthDepartment of Microbiology, Immunology, and Pathology, Colorado State UniversityDepartment of Microbiology, Immunology, and Pathology, Colorado State UniversityDepartment of Microbiology, Immunology, and Pathology, Colorado State UniversityDepartment of Statistics, Colorado State UniversityAbstract Background Rapidly developing tests for emerging diseases is critical for early disease monitoring. In the early stages of an epidemic, when low prevalences are expected, high specificity tests are desired to avoid numerous false positives. Selecting a cutoff to classify positive and negative test results that has the desired operating characteristics, such as specificity, is challenging for new tests because of limited validation data with known disease status. While there is ample statistical literature on estimating quantiles of a distribution, there is limited evidence on estimating extreme quantiles from limited validation data and the resulting test characteristics in the disease testing context. Methods We propose using extreme value theory to select a cutoff with predetermined specificity by fitting a Pareto distribution to the upper tail of the negative controls. We compared this method to five previously proposed cutoff selection methods in a data analysis and simulation study. We analyzed COVID-19 enzyme linked immunosorbent assay antibody test results from long-term care facilities and skilled nursing staff in Colorado between May and December of 2020. Results We found the extreme value approach had minimal bias when targeting a specificity of 0.995. Using the empirical quantile of the negative controls performed well when targeting a specificity of 0.95. The higher target specificity is preferred for overall test accuracy when prevalence is low, whereas the lower target specificity is preferred when prevalence is higher and resulted in less variable prevalence estimation. Discussion While commonly used, the normal based methods showed considerable bias compared to the empirical and extreme value theory-based methods. Conclusions When determining disease testing cutoffs from small training data samples, we recommend using the extreme value based-methods when targeting a high specificity and the empirical quantile when targeting a lower specificity.https://doi.org/10.1186/s12874-023-02139-5Antibody testCut pointExtreme value theorySerologyThreshold
spellingShingle	Sierra Pugh Bailey K. Fosdick Mary Nehring Emily N. Gallichotte Sue VandeWoude Ander Wilson Estimating cutoff values for diagnostic tests to achieve target specificity using extreme value theory BMC Medical Research Methodology Antibody test Cut point Extreme value theory Serology Threshold
title	Estimating cutoff values for diagnostic tests to achieve target specificity using extreme value theory
title_full	Estimating cutoff values for diagnostic tests to achieve target specificity using extreme value theory
title_fullStr	Estimating cutoff values for diagnostic tests to achieve target specificity using extreme value theory
title_full_unstemmed	Estimating cutoff values for diagnostic tests to achieve target specificity using extreme value theory
title_short	Estimating cutoff values for diagnostic tests to achieve target specificity using extreme value theory
title_sort	estimating cutoff values for diagnostic tests to achieve target specificity using extreme value theory
topic	Antibody test Cut point Extreme value theory Serology Threshold
url	https://doi.org/10.1186/s12874-023-02139-5
work_keys_str_mv	AT sierrapugh estimatingcutoffvaluesfordiagnosticteststoachievetargetspecificityusingextremevaluetheory AT baileykfosdick estimatingcutoffvaluesfordiagnosticteststoachievetargetspecificityusingextremevaluetheory AT marynehring estimatingcutoffvaluesfordiagnosticteststoachievetargetspecificityusingextremevaluetheory AT emilyngallichotte estimatingcutoffvaluesfordiagnosticteststoachievetargetspecificityusingextremevaluetheory AT suevandewoude estimatingcutoffvaluesfordiagnosticteststoachievetargetspecificityusingextremevaluetheory AT anderwilson estimatingcutoffvaluesfordiagnosticteststoachievetargetspecificityusingextremevaluetheory

Estimating cutoff values for diagnostic tests to achieve target specificity using extreme value theory

Similar Items