Population-Based Screening for Endometrial Cancer: Human vs. Machine Intelligence
Incidence and mortality rates of endometrial cancer are increasing, leading to increased interest in endometrial cancer risk prediction and stratification to help in screening and prevention. Previous risk models have had moderate success with the area under the curve (AUC) ranging from 0.68 to 0.77...
Main Authors: | , , , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Frontiers Media S.A.
2020-11-01
|
Series: | Frontiers in Artificial Intelligence |
Subjects: | |
Online Access: | https://www.frontiersin.org/articles/10.3389/frai.2020.539879/full |
_version_ | 1818589920791363584 |
---|---|
author | Gregory R. Hart Vanessa Yan Gloria S. Huang Ying Liang Bradley J. Nartowt Wazir Muhammad Jun Deng |
author_facet | Gregory R. Hart Vanessa Yan Gloria S. Huang Ying Liang Bradley J. Nartowt Wazir Muhammad Jun Deng |
author_sort | Gregory R. Hart |
collection | DOAJ |
description | Incidence and mortality rates of endometrial cancer are increasing, leading to increased interest in endometrial cancer risk prediction and stratification to help in screening and prevention. Previous risk models have had moderate success with the area under the curve (AUC) ranging from 0.68 to 0.77. Here we demonstrate a population-based machine learning model for endometrial cancer screening that achieves a testing AUC of 0.96.We train seven machine learning algorithms based solely on personal health data, without any genomic, imaging, biomarkers, or invasive procedures. The data come from the Prostate, Lung, Colorectal, and Ovarian Cancer Screening Trial (PLCO). We further compare our machine learning model with 15 gynecologic oncologists and primary care physicians in the stratification of endometrial cancer risk for 100 women.We find a random forest model that achieves a testing AUC of 0.96 and a neural network model that achieves a testing AUC of 0.91. We test both models in risk stratification against 15 practicing physicians. Our random forest model is 2.5 times better at identifying above-average risk women with a 2-fold reduction in the false positive rate. Our neural network model is 2 times better at identifying above-average risk women with a 3-fold reduction in the false positive rate.Our machine learning models provide a non-invasive and cost-effective way to identify high-risk sub-populations who may benefit from early screening of endometrial cancer, prior to disease onset. Through statistical biopsy of personal health data, we have identified a new and effective approach for early cancer detection and prevention for individual patients. |
first_indexed | 2024-12-16T09:48:19Z |
format | Article |
id | doaj.art-e02f06417af44d92981931580d506054 |
institution | Directory Open Access Journal |
issn | 2624-8212 |
language | English |
last_indexed | 2024-12-16T09:48:19Z |
publishDate | 2020-11-01 |
publisher | Frontiers Media S.A. |
record_format | Article |
series | Frontiers in Artificial Intelligence |
spelling | doaj.art-e02f06417af44d92981931580d5060542022-12-21T22:36:08ZengFrontiers Media S.A.Frontiers in Artificial Intelligence2624-82122020-11-01310.3389/frai.2020.539879539879Population-Based Screening for Endometrial Cancer: Human vs. Machine IntelligenceGregory R. Hart0Vanessa Yan1Gloria S. Huang2Ying Liang3Bradley J. Nartowt4Wazir Muhammad5Jun Deng6Department of Therapeutic Radiology, Yale University, New Haven, CT, U.S.ADepartment of Statistics and Data Science, Yale University, New Haven, CT, U.S.ADepartment of Obstetrics, Gynecology and Reproductive Sciences, Yale University, New Haven, CT, U.S.ADepartment of Therapeutic Radiology, Yale University, New Haven, CT, U.S.ADepartment of Therapeutic Radiology, Yale University, New Haven, CT, U.S.ADepartment of Therapeutic Radiology, Yale University, New Haven, CT, U.S.ADepartment of Therapeutic Radiology, Yale University, New Haven, CT, U.S.AIncidence and mortality rates of endometrial cancer are increasing, leading to increased interest in endometrial cancer risk prediction and stratification to help in screening and prevention. Previous risk models have had moderate success with the area under the curve (AUC) ranging from 0.68 to 0.77. Here we demonstrate a population-based machine learning model for endometrial cancer screening that achieves a testing AUC of 0.96.We train seven machine learning algorithms based solely on personal health data, without any genomic, imaging, biomarkers, or invasive procedures. The data come from the Prostate, Lung, Colorectal, and Ovarian Cancer Screening Trial (PLCO). We further compare our machine learning model with 15 gynecologic oncologists and primary care physicians in the stratification of endometrial cancer risk for 100 women.We find a random forest model that achieves a testing AUC of 0.96 and a neural network model that achieves a testing AUC of 0.91. We test both models in risk stratification against 15 practicing physicians. Our random forest model is 2.5 times better at identifying above-average risk women with a 2-fold reduction in the false positive rate. Our neural network model is 2 times better at identifying above-average risk women with a 3-fold reduction in the false positive rate.Our machine learning models provide a non-invasive and cost-effective way to identify high-risk sub-populations who may benefit from early screening of endometrial cancer, prior to disease onset. Through statistical biopsy of personal health data, we have identified a new and effective approach for early cancer detection and prevention for individual patients.https://www.frontiersin.org/articles/10.3389/frai.2020.539879/fullendometrial cancercancer screeningearly detectionmachine learningstatistical biopsy |
spellingShingle | Gregory R. Hart Vanessa Yan Gloria S. Huang Ying Liang Bradley J. Nartowt Wazir Muhammad Jun Deng Population-Based Screening for Endometrial Cancer: Human vs. Machine Intelligence Frontiers in Artificial Intelligence endometrial cancer cancer screening early detection machine learning statistical biopsy |
title | Population-Based Screening for Endometrial Cancer: Human vs. Machine Intelligence |
title_full | Population-Based Screening for Endometrial Cancer: Human vs. Machine Intelligence |
title_fullStr | Population-Based Screening for Endometrial Cancer: Human vs. Machine Intelligence |
title_full_unstemmed | Population-Based Screening for Endometrial Cancer: Human vs. Machine Intelligence |
title_short | Population-Based Screening for Endometrial Cancer: Human vs. Machine Intelligence |
title_sort | population based screening for endometrial cancer human vs machine intelligence |
topic | endometrial cancer cancer screening early detection machine learning statistical biopsy |
url | https://www.frontiersin.org/articles/10.3389/frai.2020.539879/full |
work_keys_str_mv | AT gregoryrhart populationbasedscreeningforendometrialcancerhumanvsmachineintelligence AT vanessayan populationbasedscreeningforendometrialcancerhumanvsmachineintelligence AT gloriashuang populationbasedscreeningforendometrialcancerhumanvsmachineintelligence AT yingliang populationbasedscreeningforendometrialcancerhumanvsmachineintelligence AT bradleyjnartowt populationbasedscreeningforendometrialcancerhumanvsmachineintelligence AT wazirmuhammad populationbasedscreeningforendometrialcancerhumanvsmachineintelligence AT jundeng populationbasedscreeningforendometrialcancerhumanvsmachineintelligence |