Improving feature extraction from histopathological images through a fine-tuning ImageNet model

Background: Due to lack of annotated pathological images, transfer learning has been the predominant approach in the field of digital pathology. Pre-trained neural networks based on ImageNet database are often used to extract “off-the-shelf” features, achieving great success in predicting tissue typ...

Full description

Bibliographic Details
Main Authors:	Xingyu Li, Min Cen, Jinfeng Xu, Hong Zhang, Xu Steven Xu
Format:	Article
Language:	English
Published:	Elsevier 2022-01-01
Series:	Journal of Pathology Informatics
Subjects:	Deep learning Whole slide images TCGA dataset H&E image Colorectal cancer Fine-tuning
Online Access:	http://www.sciencedirect.com/science/article/pii/S215335392200709X

_version_	1797977018091962368
author	Xingyu Li Min Cen Jinfeng Xu Hong Zhang Xu Steven Xu
author_facet	Xingyu Li Min Cen Jinfeng Xu Hong Zhang Xu Steven Xu
author_sort	Xingyu Li
collection	DOAJ
description	Background: Due to lack of annotated pathological images, transfer learning has been the predominant approach in the field of digital pathology. Pre-trained neural networks based on ImageNet database are often used to extract “off-the-shelf” features, achieving great success in predicting tissue types, molecular features, and clinical outcomes, etc. We hypothesize that fine-tuning the pre-trained models using histopathological images could further improve feature extraction, and downstream prediction performance. Methods: We used 100 000 annotated H&E image patches for colorectal cancer (CRC) to fine-tune a pre-trained Xception model via a 2-step approach. The features extracted from fine-tuned Xception (FTX-2048) model and Image-pretrained (IMGNET-2048) model were compared through: (1) tissue classification for H&E images from CRC, same image type that was used for fine-tuning; (2) prediction of immune-related gene expression, and (3) gene mutations for lung adenocarcinoma (LUAD). Five-fold cross validation was used for model performance evaluation. Each experiment was repeated 50 times. Findings: The extracted features from the fine-tuned FTX-2048 exhibited significantly higher accuracy (98.4%) for predicting tissue types of CRC compared to the “off-the-shelf” features directly from Xception based on ImageNet database (96.4%) (P value = 2.2 × 10−6). Particularly, FTX-2048 markedly improved the accuracy for stroma from 87% to 94%. Similarly, features from FTX-2048 boosted the prediction of transcriptomic expression of immune-related genes in LUAD. For the genes that had significant relationships with image features (P < 0.05, n = 171), the features from the fine-tuned model improved the prediction for the majority of the genes (139; 81%). In addition, features from FTX-2048 improved prediction of mutation for 5 out of 9 most frequently mutated genes (STK11, TP53, LRP1B, NF1, and FAT1) in LUAD. Conclusions: We proved the concept that fine-tuning the pretrained ImageNet neural networks with histopathology images can produce higher quality features and better prediction performance for not only the same-cancer tissue classification where similar images from the same cancer are used for fine-tuning, but also cross-cancer prediction for gene expression and mutation at patient level.
first_indexed	2024-04-11T05:00:17Z
format	Article
id	doaj.art-de717a0937ab4f59b208c034d2930cfb
institution	Directory Open Access Journal
issn	2153-3539
language	English
last_indexed	2024-04-11T05:00:17Z
publishDate	2022-01-01
publisher	Elsevier
record_format	Article
series	Journal of Pathology Informatics
spelling	doaj.art-de717a0937ab4f59b208c034d2930cfb2022-12-26T04:08:47ZengElsevierJournal of Pathology Informatics2153-35392022-01-0113100115Improving feature extraction from histopathological images through a fine-tuning ImageNet modelXingyu Li0Min Cen1Jinfeng Xu2Hong Zhang3Xu Steven Xu4Department of Statistics and Finance, School of Management, University of Science and Technology of China, Hefei, Anhui 230026, ChinaDepartment of Statistics and Finance, School of Management, University of Science and Technology of China, Hefei, Anhui 230026, ChinaDepartment of Statistics and Actuarial Science, The University of Hong Kong, Hong KongDepartment of Statistics and Finance, School of Management, University of Science and Technology of China, Hefei, Anhui 230026, China; Corresponding authors.Clinical Pharmacology and Quantitative Science, Genmab Inc., Princeton, New Jersey, USA; Corresponding authors.Background: Due to lack of annotated pathological images, transfer learning has been the predominant approach in the field of digital pathology. Pre-trained neural networks based on ImageNet database are often used to extract “off-the-shelf” features, achieving great success in predicting tissue types, molecular features, and clinical outcomes, etc. We hypothesize that fine-tuning the pre-trained models using histopathological images could further improve feature extraction, and downstream prediction performance. Methods: We used 100 000 annotated H&E image patches for colorectal cancer (CRC) to fine-tune a pre-trained Xception model via a 2-step approach. The features extracted from fine-tuned Xception (FTX-2048) model and Image-pretrained (IMGNET-2048) model were compared through: (1) tissue classification for H&E images from CRC, same image type that was used for fine-tuning; (2) prediction of immune-related gene expression, and (3) gene mutations for lung adenocarcinoma (LUAD). Five-fold cross validation was used for model performance evaluation. Each experiment was repeated 50 times. Findings: The extracted features from the fine-tuned FTX-2048 exhibited significantly higher accuracy (98.4%) for predicting tissue types of CRC compared to the “off-the-shelf” features directly from Xception based on ImageNet database (96.4%) (P value = 2.2 × 10−6). Particularly, FTX-2048 markedly improved the accuracy for stroma from 87% to 94%. Similarly, features from FTX-2048 boosted the prediction of transcriptomic expression of immune-related genes in LUAD. For the genes that had significant relationships with image features (P < 0.05, n = 171), the features from the fine-tuned model improved the prediction for the majority of the genes (139; 81%). In addition, features from FTX-2048 improved prediction of mutation for 5 out of 9 most frequently mutated genes (STK11, TP53, LRP1B, NF1, and FAT1) in LUAD. Conclusions: We proved the concept that fine-tuning the pretrained ImageNet neural networks with histopathology images can produce higher quality features and better prediction performance for not only the same-cancer tissue classification where similar images from the same cancer are used for fine-tuning, but also cross-cancer prediction for gene expression and mutation at patient level.http://www.sciencedirect.com/science/article/pii/S215335392200709XDeep learningWhole slide imagesTCGA datasetH&E imageColorectal cancerFine-tuning
spellingShingle	Xingyu Li Min Cen Jinfeng Xu Hong Zhang Xu Steven Xu Improving feature extraction from histopathological images through a fine-tuning ImageNet model Journal of Pathology Informatics Deep learning Whole slide images TCGA dataset H&E image Colorectal cancer Fine-tuning
title	Improving feature extraction from histopathological images through a fine-tuning ImageNet model
title_full	Improving feature extraction from histopathological images through a fine-tuning ImageNet model
title_fullStr	Improving feature extraction from histopathological images through a fine-tuning ImageNet model
title_full_unstemmed	Improving feature extraction from histopathological images through a fine-tuning ImageNet model
title_short	Improving feature extraction from histopathological images through a fine-tuning ImageNet model
title_sort	improving feature extraction from histopathological images through a fine tuning imagenet model
topic	Deep learning Whole slide images TCGA dataset H&E image Colorectal cancer Fine-tuning
url	http://www.sciencedirect.com/science/article/pii/S215335392200709X
work_keys_str_mv	AT xingyuli improvingfeatureextractionfromhistopathologicalimagesthroughafinetuningimagenetmodel AT mincen improvingfeatureextractionfromhistopathologicalimagesthroughafinetuningimagenetmodel AT jinfengxu improvingfeatureextractionfromhistopathologicalimagesthroughafinetuningimagenetmodel AT hongzhang improvingfeatureextractionfromhistopathologicalimagesthroughafinetuningimagenetmodel AT xustevenxu improvingfeatureextractionfromhistopathologicalimagesthroughafinetuningimagenetmodel

Improving feature extraction from histopathological images through a fine-tuning ImageNet model

Similar Items