Optimal threshold estimation for binary classifiers using game theory [version 3; referees: 2 approved, 1 approved with reservations]

Many bioinformatics algorithms can be understood as binary classifiers. They are usually compared using the area under the receiver operating characteristic (ROC) curve. On the other hand, choosing the best threshold for practical use is a complex task, due to uncertain and context-dependent skews i...

Full description

Bibliographic Details
Main Author: Ignacio Enrique Sanchez
Format: Article
Language:English
Published: F1000 Research Ltd 2017-02-01
Series:F1000Research
Subjects:
Online Access:https://f1000research.com/articles/5-2762/v3
_version_ 1818326568458518528
author Ignacio Enrique Sanchez
author_facet Ignacio Enrique Sanchez
author_sort Ignacio Enrique Sanchez
collection DOAJ
description Many bioinformatics algorithms can be understood as binary classifiers. They are usually compared using the area under the receiver operating characteristic (ROC) curve. On the other hand, choosing the best threshold for practical use is a complex task, due to uncertain and context-dependent skews in the abundance of positives in nature and in the yields/costs for correct/incorrect classification. We argue that considering a classifier as a player in a zero-sum game allows us to use the minimax principle from game theory to determine the optimal operating point. The proposed classifier threshold corresponds to the intersection between the ROC curve and the descending diagonal in ROC space and yields a minimax accuracy of 1-FPR. Our proposal can be readily implemented in practice, and reveals that the empirical condition for threshold estimation of “specificity equals sensitivity” maximizes robustness against uncertainties in the abundance of positives in nature and classification costs.
first_indexed 2024-12-13T12:02:27Z
format Article
id doaj.art-61a6571dd5ba49c3b62f73cc84e5a34d
institution Directory Open Access Journal
issn 2046-1402
language English
last_indexed 2024-12-13T12:02:27Z
publishDate 2017-02-01
publisher F1000 Research Ltd
record_format Article
series F1000Research
spelling doaj.art-61a6571dd5ba49c3b62f73cc84e5a34d2022-12-21T23:47:04ZengF1000 Research LtdF1000Research2046-14022017-02-01510.12688/f1000research.10114.311399Optimal threshold estimation for binary classifiers using game theory [version 3; referees: 2 approved, 1 approved with reservations]Ignacio Enrique Sanchez0Protein Physiology Laboratory, University of Buenos Aires, Buenos Aires, ArgentinaMany bioinformatics algorithms can be understood as binary classifiers. They are usually compared using the area under the receiver operating characteristic (ROC) curve. On the other hand, choosing the best threshold for practical use is a complex task, due to uncertain and context-dependent skews in the abundance of positives in nature and in the yields/costs for correct/incorrect classification. We argue that considering a classifier as a player in a zero-sum game allows us to use the minimax principle from game theory to determine the optimal operating point. The proposed classifier threshold corresponds to the intersection between the ROC curve and the descending diagonal in ROC space and yields a minimax accuracy of 1-FPR. Our proposal can be readily implemented in practice, and reveals that the empirical condition for threshold estimation of “specificity equals sensitivity” maximizes robustness against uncertainties in the abundance of positives in nature and classification costs.https://f1000research.com/articles/5-2762/v3Bioinformatics
spellingShingle Ignacio Enrique Sanchez
Optimal threshold estimation for binary classifiers using game theory [version 3; referees: 2 approved, 1 approved with reservations]
F1000Research
Bioinformatics
title Optimal threshold estimation for binary classifiers using game theory [version 3; referees: 2 approved, 1 approved with reservations]
title_full Optimal threshold estimation for binary classifiers using game theory [version 3; referees: 2 approved, 1 approved with reservations]
title_fullStr Optimal threshold estimation for binary classifiers using game theory [version 3; referees: 2 approved, 1 approved with reservations]
title_full_unstemmed Optimal threshold estimation for binary classifiers using game theory [version 3; referees: 2 approved, 1 approved with reservations]
title_short Optimal threshold estimation for binary classifiers using game theory [version 3; referees: 2 approved, 1 approved with reservations]
title_sort optimal threshold estimation for binary classifiers using game theory version 3 referees 2 approved 1 approved with reservations
topic Bioinformatics
url https://f1000research.com/articles/5-2762/v3
work_keys_str_mv AT ignacioenriquesanchez optimalthresholdestimationforbinaryclassifiersusinggametheoryversion3referees2approved1approvedwithreservations