Breast cancer diagnosis through knowledge distillation of Swin transformer-based teacher–student models

Breast cancer is a significant global health concern, emphasizing the crucial need for a timely and accurate diagnosis to enhance survival rates. Traditional diagnostic methods rely on pathologists analyzing whole-slide images (WSIs) to identify and diagnose malignancies. However, this task is compl...

Full description

Bibliographic Details
Main Authors: Bhavannarayanna Kolla, Venugopal P
Format: Article
Language:English
Published: IOP Publishing 2023-01-01
Series:Machine Learning: Science and Technology
Subjects:
Online Access:https://doi.org/10.1088/2632-2153/ad10cc
_version_ 1797401538218426368
author Bhavannarayanna Kolla
Venugopal P
author_facet Bhavannarayanna Kolla
Venugopal P
author_sort Bhavannarayanna Kolla
collection DOAJ
description Breast cancer is a significant global health concern, emphasizing the crucial need for a timely and accurate diagnosis to enhance survival rates. Traditional diagnostic methods rely on pathologists analyzing whole-slide images (WSIs) to identify and diagnose malignancies. However, this task is complex, demanding specialized expertise and imposing a substantial workload on pathologists. Additionally, existing deep learning models, commonly employed for classifying histopathology images, often need enhancements to ensure their suitability for real-time deployment on WSI, especially when trained for small regions of interest (ROIs). This article introduces two Swin transformer-based architectures: the teacher model, characterized by its moderate size, and the lightweight student model. Both models are trained using a publicly available dataset of breast cancer histopathology images, focusing on ROIs with varying magnification factors. Transfer learning is applied to train the teacher model, and knowledge distillation (KD) transfers its capabilities to the student model. To enhance validation accuracy and minimize the total loss in KD, we employ the state–action–reward–state–action (SARSA) reinforcement learning algorithm. The algorithm dynamically computes temperature and a weighting factor throughout the KD process to achieve high accuracy within a considerably shorter training timeframe. Additionally, the student model is deployed to analyze malignancies in WSI. Despite the student model being only one-third the size and flops of the teacher model, it achieves an impressive accuracy of 98.71%, slightly below the teacher’s accuracy of 98.91%. Experimental results demonstrate that the student model can process WSIs at a throughput of 1.67 samples s ^−1 with an accuracy of 82%. The proposed student model, trained using KD and the SARSA algorithm, exhibits promising breast cancer classification and WSI analysis performance. These findings indicate its potential for assisting pathologists in diagnosing breast cancer accurately and effectively.
first_indexed 2024-03-09T02:11:32Z
format Article
id doaj.art-3bae98e88cba4ab7b0ee809a79c8e285
institution Directory Open Access Journal
issn 2632-2153
language English
last_indexed 2024-03-09T02:11:32Z
publishDate 2023-01-01
publisher IOP Publishing
record_format Article
series Machine Learning: Science and Technology
spelling doaj.art-3bae98e88cba4ab7b0ee809a79c8e2852023-12-07T09:25:30ZengIOP PublishingMachine Learning: Science and Technology2632-21532023-01-014404504710.1088/2632-2153/ad10ccBreast cancer diagnosis through knowledge distillation of Swin transformer-based teacher–student modelsBhavannarayanna Kolla0https://orcid.org/0000-0003-1490-5962Venugopal P1https://orcid.org/0000-0002-5608-8414School of Electronics Engineering, Vellore Institute of Technology , Vellore 632014, Tamil Nadu, IndiaSchool of Electronics Engineering, Vellore Institute of Technology , Vellore 632014, Tamil Nadu, IndiaBreast cancer is a significant global health concern, emphasizing the crucial need for a timely and accurate diagnosis to enhance survival rates. Traditional diagnostic methods rely on pathologists analyzing whole-slide images (WSIs) to identify and diagnose malignancies. However, this task is complex, demanding specialized expertise and imposing a substantial workload on pathologists. Additionally, existing deep learning models, commonly employed for classifying histopathology images, often need enhancements to ensure their suitability for real-time deployment on WSI, especially when trained for small regions of interest (ROIs). This article introduces two Swin transformer-based architectures: the teacher model, characterized by its moderate size, and the lightweight student model. Both models are trained using a publicly available dataset of breast cancer histopathology images, focusing on ROIs with varying magnification factors. Transfer learning is applied to train the teacher model, and knowledge distillation (KD) transfers its capabilities to the student model. To enhance validation accuracy and minimize the total loss in KD, we employ the state–action–reward–state–action (SARSA) reinforcement learning algorithm. The algorithm dynamically computes temperature and a weighting factor throughout the KD process to achieve high accuracy within a considerably shorter training timeframe. Additionally, the student model is deployed to analyze malignancies in WSI. Despite the student model being only one-third the size and flops of the teacher model, it achieves an impressive accuracy of 98.71%, slightly below the teacher’s accuracy of 98.91%. Experimental results demonstrate that the student model can process WSIs at a throughput of 1.67 samples s ^−1 with an accuracy of 82%. The proposed student model, trained using KD and the SARSA algorithm, exhibits promising breast cancer classification and WSI analysis performance. These findings indicate its potential for assisting pathologists in diagnosing breast cancer accurately and effectively.https://doi.org/10.1088/2632-2153/ad10ccteacher modelstudent modelSwin-transformerstransfer learningknowledge distillationbreast cancer histopathology
spellingShingle Bhavannarayanna Kolla
Venugopal P
Breast cancer diagnosis through knowledge distillation of Swin transformer-based teacher–student models
Machine Learning: Science and Technology
teacher model
student model
Swin-transformers
transfer learning
knowledge distillation
breast cancer histopathology
title Breast cancer diagnosis through knowledge distillation of Swin transformer-based teacher–student models
title_full Breast cancer diagnosis through knowledge distillation of Swin transformer-based teacher–student models
title_fullStr Breast cancer diagnosis through knowledge distillation of Swin transformer-based teacher–student models
title_full_unstemmed Breast cancer diagnosis through knowledge distillation of Swin transformer-based teacher–student models
title_short Breast cancer diagnosis through knowledge distillation of Swin transformer-based teacher–student models
title_sort breast cancer diagnosis through knowledge distillation of swin transformer based teacher student models
topic teacher model
student model
Swin-transformers
transfer learning
knowledge distillation
breast cancer histopathology
url https://doi.org/10.1088/2632-2153/ad10cc
work_keys_str_mv AT bhavannarayannakolla breastcancerdiagnosisthroughknowledgedistillationofswintransformerbasedteacherstudentmodels
AT venugopalp breastcancerdiagnosisthroughknowledgedistillationofswintransformerbasedteacherstudentmodels