Population-Based Screening for Endometrial Cancer: Human vs. Machine Intelligence

Incidence and mortality rates of endometrial cancer are increasing, leading to increased interest in endometrial cancer risk prediction and stratification to help in screening and prevention. Previous risk models have had moderate success with the area under the curve (AUC) ranging from 0.68 to 0.77...

Full description

Bibliographic Details
Main Authors: Gregory R. Hart, Vanessa Yan, Gloria S. Huang, Ying Liang, Bradley J. Nartowt, Wazir Muhammad, Jun Deng
Format: Article
Language:English
Published: Frontiers Media S.A. 2020-11-01
Series:Frontiers in Artificial Intelligence
Subjects:
Online Access:https://www.frontiersin.org/articles/10.3389/frai.2020.539879/full
_version_ 1818589920791363584
author Gregory R. Hart
Vanessa Yan
Gloria S. Huang
Ying Liang
Bradley J. Nartowt
Wazir Muhammad
Jun Deng
author_facet Gregory R. Hart
Vanessa Yan
Gloria S. Huang
Ying Liang
Bradley J. Nartowt
Wazir Muhammad
Jun Deng
author_sort Gregory R. Hart
collection DOAJ
description Incidence and mortality rates of endometrial cancer are increasing, leading to increased interest in endometrial cancer risk prediction and stratification to help in screening and prevention. Previous risk models have had moderate success with the area under the curve (AUC) ranging from 0.68 to 0.77. Here we demonstrate a population-based machine learning model for endometrial cancer screening that achieves a testing AUC of 0.96.We train seven machine learning algorithms based solely on personal health data, without any genomic, imaging, biomarkers, or invasive procedures. The data come from the Prostate, Lung, Colorectal, and Ovarian Cancer Screening Trial (PLCO). We further compare our machine learning model with 15 gynecologic oncologists and primary care physicians in the stratification of endometrial cancer risk for 100 women.We find a random forest model that achieves a testing AUC of 0.96 and a neural network model that achieves a testing AUC of 0.91. We test both models in risk stratification against 15 practicing physicians. Our random forest model is 2.5 times better at identifying above-average risk women with a 2-fold reduction in the false positive rate. Our neural network model is 2 times better at identifying above-average risk women with a 3-fold reduction in the false positive rate.Our machine learning models provide a non-invasive and cost-effective way to identify high-risk sub-populations who may benefit from early screening of endometrial cancer, prior to disease onset. Through statistical biopsy of personal health data, we have identified a new and effective approach for early cancer detection and prevention for individual patients.
first_indexed 2024-12-16T09:48:19Z
format Article
id doaj.art-e02f06417af44d92981931580d506054
institution Directory Open Access Journal
issn 2624-8212
language English
last_indexed 2024-12-16T09:48:19Z
publishDate 2020-11-01
publisher Frontiers Media S.A.
record_format Article
series Frontiers in Artificial Intelligence
spelling doaj.art-e02f06417af44d92981931580d5060542022-12-21T22:36:08ZengFrontiers Media S.A.Frontiers in Artificial Intelligence2624-82122020-11-01310.3389/frai.2020.539879539879Population-Based Screening for Endometrial Cancer: Human vs. Machine IntelligenceGregory R. Hart0Vanessa Yan1Gloria S. Huang2Ying Liang3Bradley J. Nartowt4Wazir Muhammad5Jun Deng6Department of Therapeutic Radiology, Yale University, New Haven, CT, U.S.ADepartment of Statistics and Data Science, Yale University, New Haven, CT, U.S.ADepartment of Obstetrics, Gynecology and Reproductive Sciences, Yale University, New Haven, CT, U.S.ADepartment of Therapeutic Radiology, Yale University, New Haven, CT, U.S.ADepartment of Therapeutic Radiology, Yale University, New Haven, CT, U.S.ADepartment of Therapeutic Radiology, Yale University, New Haven, CT, U.S.ADepartment of Therapeutic Radiology, Yale University, New Haven, CT, U.S.AIncidence and mortality rates of endometrial cancer are increasing, leading to increased interest in endometrial cancer risk prediction and stratification to help in screening and prevention. Previous risk models have had moderate success with the area under the curve (AUC) ranging from 0.68 to 0.77. Here we demonstrate a population-based machine learning model for endometrial cancer screening that achieves a testing AUC of 0.96.We train seven machine learning algorithms based solely on personal health data, without any genomic, imaging, biomarkers, or invasive procedures. The data come from the Prostate, Lung, Colorectal, and Ovarian Cancer Screening Trial (PLCO). We further compare our machine learning model with 15 gynecologic oncologists and primary care physicians in the stratification of endometrial cancer risk for 100 women.We find a random forest model that achieves a testing AUC of 0.96 and a neural network model that achieves a testing AUC of 0.91. We test both models in risk stratification against 15 practicing physicians. Our random forest model is 2.5 times better at identifying above-average risk women with a 2-fold reduction in the false positive rate. Our neural network model is 2 times better at identifying above-average risk women with a 3-fold reduction in the false positive rate.Our machine learning models provide a non-invasive and cost-effective way to identify high-risk sub-populations who may benefit from early screening of endometrial cancer, prior to disease onset. Through statistical biopsy of personal health data, we have identified a new and effective approach for early cancer detection and prevention for individual patients.https://www.frontiersin.org/articles/10.3389/frai.2020.539879/fullendometrial cancercancer screeningearly detectionmachine learningstatistical biopsy
spellingShingle Gregory R. Hart
Vanessa Yan
Gloria S. Huang
Ying Liang
Bradley J. Nartowt
Wazir Muhammad
Jun Deng
Population-Based Screening for Endometrial Cancer: Human vs. Machine Intelligence
Frontiers in Artificial Intelligence
endometrial cancer
cancer screening
early detection
machine learning
statistical biopsy
title Population-Based Screening for Endometrial Cancer: Human vs. Machine Intelligence
title_full Population-Based Screening for Endometrial Cancer: Human vs. Machine Intelligence
title_fullStr Population-Based Screening for Endometrial Cancer: Human vs. Machine Intelligence
title_full_unstemmed Population-Based Screening for Endometrial Cancer: Human vs. Machine Intelligence
title_short Population-Based Screening for Endometrial Cancer: Human vs. Machine Intelligence
title_sort population based screening for endometrial cancer human vs machine intelligence
topic endometrial cancer
cancer screening
early detection
machine learning
statistical biopsy
url https://www.frontiersin.org/articles/10.3389/frai.2020.539879/full
work_keys_str_mv AT gregoryrhart populationbasedscreeningforendometrialcancerhumanvsmachineintelligence
AT vanessayan populationbasedscreeningforendometrialcancerhumanvsmachineintelligence
AT gloriashuang populationbasedscreeningforendometrialcancerhumanvsmachineintelligence
AT yingliang populationbasedscreeningforendometrialcancerhumanvsmachineintelligence
AT bradleyjnartowt populationbasedscreeningforendometrialcancerhumanvsmachineintelligence
AT wazirmuhammad populationbasedscreeningforendometrialcancerhumanvsmachineintelligence
AT jundeng populationbasedscreeningforendometrialcancerhumanvsmachineintelligence