Channel-Attention U-Net: Channel Attention Mechanism for Semantic Segmentation of Esophagus and Esophageal Cancer

The effective segmentation of esophagus and esophageal cancer from Computed Tomography (CT) images can meaningfully assist doctors in the diagnosis and treatment of esophageal cancer patients. However, problems such as the small proportion of the esophageal region in CT images and the irregular shap...

Full description

Bibliographic Details
Main Authors: Guoheng Huang, Junwen Zhu, Jiajian Li, Zhuowei Wang, Lianglun Cheng, Lizhi Liu, Haojiang Li, Jian Zhou
Format: Article
Language:English
Published: IEEE 2020-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/9134858/
_version_ 1818875406917304320
author Guoheng Huang
Junwen Zhu
Jiajian Li
Zhuowei Wang
Lianglun Cheng
Lizhi Liu
Haojiang Li
Jian Zhou
author_facet Guoheng Huang
Junwen Zhu
Jiajian Li
Zhuowei Wang
Lianglun Cheng
Lizhi Liu
Haojiang Li
Jian Zhou
author_sort Guoheng Huang
collection DOAJ
description The effective segmentation of esophagus and esophageal cancer from Computed Tomography (CT) images can meaningfully assist doctors in the diagnosis and treatment of esophageal cancer patients. However, problems such as the small proportion of the esophageal region in CT images and the irregular shape of the esophagus will make the diagnosis difficult. In practical applications, not all esophagus and esophageal cancer morphology can be included in the training set, so the generalization ability of the model is very important. These are the difficulties in segmenting the esophagus and esophageal cancer. Since some adjacent tissues and organs of the esophagus are visually close to the esophagus and esophageal cancer, how to ensure that the network can extract effective distinguishing features has become the focus of research. In this paper, a novel U-Net structure - Channel-attention U-Net is proposed to segment esophagus and esophageal cancer from CT slices. This novel network combines a Channel Attention Module (CAM) that can distinguish the esophagus and surrounding tissues by emphasizing and inhibiting channel feature and Cross-level Feature Fusion Module (CFFM) which is utilized to strengthen the generalization ability of the network by using high-level features to weight low-level features. Because the high-level features represent specific organizational information, and the low-level features represent the characteristics of detailed information such as edges and contours, the network can learn specific detailed features of a definite organization. In addition, to locate the esophageal region better, a 3D semi-automatic method for segmenting esophagus and esophageal cancer is proposed. The proposed network is trained using 46,400 CT pictures as the training set and divides 11,600 CT images from the dataset at a ratio of 0.2 as the validation set. Finally, 7,250 CT images were used as the test set to test the performance of the network. The experimental results show that the IoU value of our network can reach 0.625, the dice value is 0.732 and the Hausdorff distance is 3.193.
first_indexed 2024-12-19T13:26:00Z
format Article
id doaj.art-8b466ad79fad4d5c8082ba0cb360e41b
institution Directory Open Access Journal
issn 2169-3536
language English
last_indexed 2024-12-19T13:26:00Z
publishDate 2020-01-01
publisher IEEE
record_format Article
series IEEE Access
spelling doaj.art-8b466ad79fad4d5c8082ba0cb360e41b2022-12-21T20:19:32ZengIEEEIEEE Access2169-35362020-01-01812279812281010.1109/ACCESS.2020.30077199134858Channel-Attention U-Net: Channel Attention Mechanism for Semantic Segmentation of Esophagus and Esophageal CancerGuoheng Huang0https://orcid.org/0000-0002-3640-3229Junwen Zhu1Jiajian Li2Zhuowei Wang3https://orcid.org/0000-0001-6479-5154Lianglun Cheng4Lizhi Liu5Haojiang Li6Jian Zhou7https://orcid.org/0000-0002-6868-9866School of Computers, Guangdong University of Technology, Guangzhou, ChinaSchool of Computers, Guangdong University of Technology, Guangzhou, ChinaSchool of Computers, Guangdong University of Technology, Guangzhou, ChinaSchool of Computers, Guangdong University of Technology, Guangzhou, ChinaSchool of Computers, Guangdong University of Technology, Guangzhou, ChinaDepartment of Medical Imaging, State Key Laboratory of Oncology in South China, Collaborative Innovation Center for Cancer Medicine, Sun Yat-sen University Cancer Center, Guangzhou, ChinaDepartment of Medical Imaging, State Key Laboratory of Oncology in South China, Collaborative Innovation Center for Cancer Medicine, Sun Yat-sen University Cancer Center, Guangzhou, ChinaDepartment of Medical Imaging, State Key Laboratory of Oncology in South China, Collaborative Innovation Center for Cancer Medicine, Sun Yat-sen University Cancer Center, Guangzhou, ChinaThe effective segmentation of esophagus and esophageal cancer from Computed Tomography (CT) images can meaningfully assist doctors in the diagnosis and treatment of esophageal cancer patients. However, problems such as the small proportion of the esophageal region in CT images and the irregular shape of the esophagus will make the diagnosis difficult. In practical applications, not all esophagus and esophageal cancer morphology can be included in the training set, so the generalization ability of the model is very important. These are the difficulties in segmenting the esophagus and esophageal cancer. Since some adjacent tissues and organs of the esophagus are visually close to the esophagus and esophageal cancer, how to ensure that the network can extract effective distinguishing features has become the focus of research. In this paper, a novel U-Net structure - Channel-attention U-Net is proposed to segment esophagus and esophageal cancer from CT slices. This novel network combines a Channel Attention Module (CAM) that can distinguish the esophagus and surrounding tissues by emphasizing and inhibiting channel feature and Cross-level Feature Fusion Module (CFFM) which is utilized to strengthen the generalization ability of the network by using high-level features to weight low-level features. Because the high-level features represent specific organizational information, and the low-level features represent the characteristics of detailed information such as edges and contours, the network can learn specific detailed features of a definite organization. In addition, to locate the esophageal region better, a 3D semi-automatic method for segmenting esophagus and esophageal cancer is proposed. The proposed network is trained using 46,400 CT pictures as the training set and divides 11,600 CT images from the dataset at a ratio of 0.2 as the validation set. Finally, 7,250 CT images were used as the test set to test the performance of the network. The experimental results show that the IoU value of our network can reach 0.625, the dice value is 0.732 and the Hausdorff distance is 3.193.https://ieeexplore.ieee.org/document/9134858/Esophageal cancerchannel attention mechanismdeep learningcomputed tomography (CT)
spellingShingle Guoheng Huang
Junwen Zhu
Jiajian Li
Zhuowei Wang
Lianglun Cheng
Lizhi Liu
Haojiang Li
Jian Zhou
Channel-Attention U-Net: Channel Attention Mechanism for Semantic Segmentation of Esophagus and Esophageal Cancer
IEEE Access
Esophageal cancer
channel attention mechanism
deep learning
computed tomography (CT)
title Channel-Attention U-Net: Channel Attention Mechanism for Semantic Segmentation of Esophagus and Esophageal Cancer
title_full Channel-Attention U-Net: Channel Attention Mechanism for Semantic Segmentation of Esophagus and Esophageal Cancer
title_fullStr Channel-Attention U-Net: Channel Attention Mechanism for Semantic Segmentation of Esophagus and Esophageal Cancer
title_full_unstemmed Channel-Attention U-Net: Channel Attention Mechanism for Semantic Segmentation of Esophagus and Esophageal Cancer
title_short Channel-Attention U-Net: Channel Attention Mechanism for Semantic Segmentation of Esophagus and Esophageal Cancer
title_sort channel attention u net channel attention mechanism for semantic segmentation of esophagus and esophageal cancer
topic Esophageal cancer
channel attention mechanism
deep learning
computed tomography (CT)
url https://ieeexplore.ieee.org/document/9134858/
work_keys_str_mv AT guohenghuang channelattentionunetchannelattentionmechanismforsemanticsegmentationofesophagusandesophagealcancer
AT junwenzhu channelattentionunetchannelattentionmechanismforsemanticsegmentationofesophagusandesophagealcancer
AT jiajianli channelattentionunetchannelattentionmechanismforsemanticsegmentationofesophagusandesophagealcancer
AT zhuoweiwang channelattentionunetchannelattentionmechanismforsemanticsegmentationofesophagusandesophagealcancer
AT liangluncheng channelattentionunetchannelattentionmechanismforsemanticsegmentationofesophagusandesophagealcancer
AT lizhiliu channelattentionunetchannelattentionmechanismforsemanticsegmentationofesophagusandesophagealcancer
AT haojiangli channelattentionunetchannelattentionmechanismforsemanticsegmentationofesophagusandesophagealcancer
AT jianzhou channelattentionunetchannelattentionmechanismforsemanticsegmentationofesophagusandesophagealcancer