Channel-Attention U-Net: Channel Attention Mechanism for Semantic Segmentation of Esophagus and Esophageal Cancer
The effective segmentation of esophagus and esophageal cancer from Computed Tomography (CT) images can meaningfully assist doctors in the diagnosis and treatment of esophageal cancer patients. However, problems such as the small proportion of the esophageal region in CT images and the irregular shap...
Main Authors: | , , , , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
IEEE
2020-01-01
|
Series: | IEEE Access |
Subjects: | |
Online Access: | https://ieeexplore.ieee.org/document/9134858/ |
_version_ | 1818875406917304320 |
---|---|
author | Guoheng Huang Junwen Zhu Jiajian Li Zhuowei Wang Lianglun Cheng Lizhi Liu Haojiang Li Jian Zhou |
author_facet | Guoheng Huang Junwen Zhu Jiajian Li Zhuowei Wang Lianglun Cheng Lizhi Liu Haojiang Li Jian Zhou |
author_sort | Guoheng Huang |
collection | DOAJ |
description | The effective segmentation of esophagus and esophageal cancer from Computed Tomography (CT) images can meaningfully assist doctors in the diagnosis and treatment of esophageal cancer patients. However, problems such as the small proportion of the esophageal region in CT images and the irregular shape of the esophagus will make the diagnosis difficult. In practical applications, not all esophagus and esophageal cancer morphology can be included in the training set, so the generalization ability of the model is very important. These are the difficulties in segmenting the esophagus and esophageal cancer. Since some adjacent tissues and organs of the esophagus are visually close to the esophagus and esophageal cancer, how to ensure that the network can extract effective distinguishing features has become the focus of research. In this paper, a novel U-Net structure - Channel-attention U-Net is proposed to segment esophagus and esophageal cancer from CT slices. This novel network combines a Channel Attention Module (CAM) that can distinguish the esophagus and surrounding tissues by emphasizing and inhibiting channel feature and Cross-level Feature Fusion Module (CFFM) which is utilized to strengthen the generalization ability of the network by using high-level features to weight low-level features. Because the high-level features represent specific organizational information, and the low-level features represent the characteristics of detailed information such as edges and contours, the network can learn specific detailed features of a definite organization. In addition, to locate the esophageal region better, a 3D semi-automatic method for segmenting esophagus and esophageal cancer is proposed. The proposed network is trained using 46,400 CT pictures as the training set and divides 11,600 CT images from the dataset at a ratio of 0.2 as the validation set. Finally, 7,250 CT images were used as the test set to test the performance of the network. The experimental results show that the IoU value of our network can reach 0.625, the dice value is 0.732 and the Hausdorff distance is 3.193. |
first_indexed | 2024-12-19T13:26:00Z |
format | Article |
id | doaj.art-8b466ad79fad4d5c8082ba0cb360e41b |
institution | Directory Open Access Journal |
issn | 2169-3536 |
language | English |
last_indexed | 2024-12-19T13:26:00Z |
publishDate | 2020-01-01 |
publisher | IEEE |
record_format | Article |
series | IEEE Access |
spelling | doaj.art-8b466ad79fad4d5c8082ba0cb360e41b2022-12-21T20:19:32ZengIEEEIEEE Access2169-35362020-01-01812279812281010.1109/ACCESS.2020.30077199134858Channel-Attention U-Net: Channel Attention Mechanism for Semantic Segmentation of Esophagus and Esophageal CancerGuoheng Huang0https://orcid.org/0000-0002-3640-3229Junwen Zhu1Jiajian Li2Zhuowei Wang3https://orcid.org/0000-0001-6479-5154Lianglun Cheng4Lizhi Liu5Haojiang Li6Jian Zhou7https://orcid.org/0000-0002-6868-9866School of Computers, Guangdong University of Technology, Guangzhou, ChinaSchool of Computers, Guangdong University of Technology, Guangzhou, ChinaSchool of Computers, Guangdong University of Technology, Guangzhou, ChinaSchool of Computers, Guangdong University of Technology, Guangzhou, ChinaSchool of Computers, Guangdong University of Technology, Guangzhou, ChinaDepartment of Medical Imaging, State Key Laboratory of Oncology in South China, Collaborative Innovation Center for Cancer Medicine, Sun Yat-sen University Cancer Center, Guangzhou, ChinaDepartment of Medical Imaging, State Key Laboratory of Oncology in South China, Collaborative Innovation Center for Cancer Medicine, Sun Yat-sen University Cancer Center, Guangzhou, ChinaDepartment of Medical Imaging, State Key Laboratory of Oncology in South China, Collaborative Innovation Center for Cancer Medicine, Sun Yat-sen University Cancer Center, Guangzhou, ChinaThe effective segmentation of esophagus and esophageal cancer from Computed Tomography (CT) images can meaningfully assist doctors in the diagnosis and treatment of esophageal cancer patients. However, problems such as the small proportion of the esophageal region in CT images and the irregular shape of the esophagus will make the diagnosis difficult. In practical applications, not all esophagus and esophageal cancer morphology can be included in the training set, so the generalization ability of the model is very important. These are the difficulties in segmenting the esophagus and esophageal cancer. Since some adjacent tissues and organs of the esophagus are visually close to the esophagus and esophageal cancer, how to ensure that the network can extract effective distinguishing features has become the focus of research. In this paper, a novel U-Net structure - Channel-attention U-Net is proposed to segment esophagus and esophageal cancer from CT slices. This novel network combines a Channel Attention Module (CAM) that can distinguish the esophagus and surrounding tissues by emphasizing and inhibiting channel feature and Cross-level Feature Fusion Module (CFFM) which is utilized to strengthen the generalization ability of the network by using high-level features to weight low-level features. Because the high-level features represent specific organizational information, and the low-level features represent the characteristics of detailed information such as edges and contours, the network can learn specific detailed features of a definite organization. In addition, to locate the esophageal region better, a 3D semi-automatic method for segmenting esophagus and esophageal cancer is proposed. The proposed network is trained using 46,400 CT pictures as the training set and divides 11,600 CT images from the dataset at a ratio of 0.2 as the validation set. Finally, 7,250 CT images were used as the test set to test the performance of the network. The experimental results show that the IoU value of our network can reach 0.625, the dice value is 0.732 and the Hausdorff distance is 3.193.https://ieeexplore.ieee.org/document/9134858/Esophageal cancerchannel attention mechanismdeep learningcomputed tomography (CT) |
spellingShingle | Guoheng Huang Junwen Zhu Jiajian Li Zhuowei Wang Lianglun Cheng Lizhi Liu Haojiang Li Jian Zhou Channel-Attention U-Net: Channel Attention Mechanism for Semantic Segmentation of Esophagus and Esophageal Cancer IEEE Access Esophageal cancer channel attention mechanism deep learning computed tomography (CT) |
title | Channel-Attention U-Net: Channel Attention Mechanism for Semantic Segmentation of Esophagus and Esophageal Cancer |
title_full | Channel-Attention U-Net: Channel Attention Mechanism for Semantic Segmentation of Esophagus and Esophageal Cancer |
title_fullStr | Channel-Attention U-Net: Channel Attention Mechanism for Semantic Segmentation of Esophagus and Esophageal Cancer |
title_full_unstemmed | Channel-Attention U-Net: Channel Attention Mechanism for Semantic Segmentation of Esophagus and Esophageal Cancer |
title_short | Channel-Attention U-Net: Channel Attention Mechanism for Semantic Segmentation of Esophagus and Esophageal Cancer |
title_sort | channel attention u net channel attention mechanism for semantic segmentation of esophagus and esophageal cancer |
topic | Esophageal cancer channel attention mechanism deep learning computed tomography (CT) |
url | https://ieeexplore.ieee.org/document/9134858/ |
work_keys_str_mv | AT guohenghuang channelattentionunetchannelattentionmechanismforsemanticsegmentationofesophagusandesophagealcancer AT junwenzhu channelattentionunetchannelattentionmechanismforsemanticsegmentationofesophagusandesophagealcancer AT jiajianli channelattentionunetchannelattentionmechanismforsemanticsegmentationofesophagusandesophagealcancer AT zhuoweiwang channelattentionunetchannelattentionmechanismforsemanticsegmentationofesophagusandesophagealcancer AT liangluncheng channelattentionunetchannelattentionmechanismforsemanticsegmentationofesophagusandesophagealcancer AT lizhiliu channelattentionunetchannelattentionmechanismforsemanticsegmentationofesophagusandesophagealcancer AT haojiangli channelattentionunetchannelattentionmechanismforsemanticsegmentationofesophagusandesophagealcancer AT jianzhou channelattentionunetchannelattentionmechanismforsemanticsegmentationofesophagusandesophagealcancer |