Optimization of deep learning models for the prediction of gene mutations using unsupervised clustering

Abstract Deep learning models are increasingly being used to interpret whole‐slide images (WSIs) in digital pathology and to predict genetic mutations. Currently, it is commonly assumed that tumor regions have most of the predictive power. However, it is reasonable to assume that other tissues from...

Full description

Bibliographic Details
Main Authors: Zihan Chen, Xingyu Li, Miaomiao Yang, Hong Zhang, Xu Steven Xu
Format: Article
Language:English
Published: Wiley 2023-01-01
Series:The Journal of Pathology: Clinical Research
Subjects:
Online Access:https://doi.org/10.1002/cjp2.302
_version_ 1811178385126195200
author Zihan Chen
Xingyu Li
Miaomiao Yang
Hong Zhang
Xu Steven Xu
author_facet Zihan Chen
Xingyu Li
Miaomiao Yang
Hong Zhang
Xu Steven Xu
author_sort Zihan Chen
collection DOAJ
description Abstract Deep learning models are increasingly being used to interpret whole‐slide images (WSIs) in digital pathology and to predict genetic mutations. Currently, it is commonly assumed that tumor regions have most of the predictive power. However, it is reasonable to assume that other tissues from the tumor microenvironment may also provide important predictive information. In this paper, we propose an unsupervised clustering‐based multiple‐instance deep learning model for the prediction of genetic mutations using WSIs of three cancer types obtained from The Cancer Genome Atlas. Our proposed model facilitates the identification of spatial regions related to specific gene mutations and exclusion of patches that lack predictive information through the use of unsupervised clustering. This results in a more accurate prediction of gene mutations when compared with models using all image patches on WSIs and two recently published algorithms for all three different cancer types evaluated in this study. In addition, our study validates the hypothesis that the prediction of gene mutations solely based on tumor regions on WSI slides may not always provide the best performance. Other tissue types in the tumor microenvironment could provide a better prediction ability than tumor tissues alone. These results highlight the heterogeneity in the tumor microenvironment and the importance of identification of predictive image patches in digital pathology prediction tasks.
first_indexed 2024-04-11T06:17:15Z
format Article
id doaj.art-41e9159432ab4ee2afb49f3a95e5d01d
institution Directory Open Access Journal
issn 2056-4538
language English
last_indexed 2024-04-11T06:17:15Z
publishDate 2023-01-01
publisher Wiley
record_format Article
series The Journal of Pathology: Clinical Research
spelling doaj.art-41e9159432ab4ee2afb49f3a95e5d01d2022-12-22T04:41:01ZengWileyThe Journal of Pathology: Clinical Research2056-45382023-01-019131710.1002/cjp2.302Optimization of deep learning models for the prediction of gene mutations using unsupervised clusteringZihan Chen0Xingyu Li1Miaomiao Yang2Hong Zhang3Xu Steven Xu4School of Data Science University of Science and Technology of China Hefei PR ChinaDepartment of Statistics and Finance, School of Management University of Science and Technology of China Hefei PR ChinaClinical Pathology Center The Fourth Affiliated Hospital of Anhui Medical University Hefei PR ChinaDepartment of Statistics and Finance, School of Management University of Science and Technology of China Hefei PR ChinaClinical Pharmacology and Quantitative Science Genmab Inc. Princeton NJ USAAbstract Deep learning models are increasingly being used to interpret whole‐slide images (WSIs) in digital pathology and to predict genetic mutations. Currently, it is commonly assumed that tumor regions have most of the predictive power. However, it is reasonable to assume that other tissues from the tumor microenvironment may also provide important predictive information. In this paper, we propose an unsupervised clustering‐based multiple‐instance deep learning model for the prediction of genetic mutations using WSIs of three cancer types obtained from The Cancer Genome Atlas. Our proposed model facilitates the identification of spatial regions related to specific gene mutations and exclusion of patches that lack predictive information through the use of unsupervised clustering. This results in a more accurate prediction of gene mutations when compared with models using all image patches on WSIs and two recently published algorithms for all three different cancer types evaluated in this study. In addition, our study validates the hypothesis that the prediction of gene mutations solely based on tumor regions on WSI slides may not always provide the best performance. Other tissue types in the tumor microenvironment could provide a better prediction ability than tumor tissues alone. These results highlight the heterogeneity in the tumor microenvironment and the importance of identification of predictive image patches in digital pathology prediction tasks.https://doi.org/10.1002/cjp2.302deep learningwhole‐slide imagesH&E imagegene mutationunsupervised clustering
spellingShingle Zihan Chen
Xingyu Li
Miaomiao Yang
Hong Zhang
Xu Steven Xu
Optimization of deep learning models for the prediction of gene mutations using unsupervised clustering
The Journal of Pathology: Clinical Research
deep learning
whole‐slide images
H&E image
gene mutation
unsupervised clustering
title Optimization of deep learning models for the prediction of gene mutations using unsupervised clustering
title_full Optimization of deep learning models for the prediction of gene mutations using unsupervised clustering
title_fullStr Optimization of deep learning models for the prediction of gene mutations using unsupervised clustering
title_full_unstemmed Optimization of deep learning models for the prediction of gene mutations using unsupervised clustering
title_short Optimization of deep learning models for the prediction of gene mutations using unsupervised clustering
title_sort optimization of deep learning models for the prediction of gene mutations using unsupervised clustering
topic deep learning
whole‐slide images
H&E image
gene mutation
unsupervised clustering
url https://doi.org/10.1002/cjp2.302
work_keys_str_mv AT zihanchen optimizationofdeeplearningmodelsforthepredictionofgenemutationsusingunsupervisedclustering
AT xingyuli optimizationofdeeplearningmodelsforthepredictionofgenemutationsusingunsupervisedclustering
AT miaomiaoyang optimizationofdeeplearningmodelsforthepredictionofgenemutationsusingunsupervisedclustering
AT hongzhang optimizationofdeeplearningmodelsforthepredictionofgenemutationsusingunsupervisedclustering
AT xustevenxu optimizationofdeeplearningmodelsforthepredictionofgenemutationsusingunsupervisedclustering