CEFusion: Multi‐Modal medical image fusion via cross encoder

Abstract Most existing deep learning‐based multi‐modal medical image fusion (MMIF) methods utilize single‐branch feature extraction strategies to achieve good fusion performance. However, for MMIF tasks, it is thought that this structure cuts off the internal connections between source images, resul...

Full description

Bibliographic Details
Main Authors:	Ya Zhu, Xue Wang, Luping Chen, Rencan Nie
Format:	Article
Language:	English
Published:	Wiley 2022-10-01
Series:	IET Image Processing
Online Access:	https://doi.org/10.1049/ipr2.12549

_version_	1811337562655031296
author	Ya Zhu Xue Wang Luping Chen Rencan Nie
author_facet	Ya Zhu Xue Wang Luping Chen Rencan Nie
author_sort	Ya Zhu
collection	DOAJ
description	Abstract Most existing deep learning‐based multi‐modal medical image fusion (MMIF) methods utilize single‐branch feature extraction strategies to achieve good fusion performance. However, for MMIF tasks, it is thought that this structure cuts off the internal connections between source images, resulting in information redundancy and degradation of fusion performance. To this end, this paper proposes a novel unsupervised network, termed CEFusion. Different from existing architecture, a cross‐encoder is designed by exploiting the complementary properties between the original image to refine source features through feature interaction and reuse. Furthermore, to force the network to learn complementary information between source images and generate the fused image with high contrast and rich textures, a hybrid loss is proposed consisting of weighted fidelity and gradient losses. Specifically, the weighted fidelity loss can not only force the fusion results to approximate the source images but also effectively preserve the luminance information of the source image through weight estimation, while the gradient loss preserves the texture information of the source image. Experimental results demonstrate the superiority of the method over the state‐of‐the‐art in terms of subjective visual effect and quantitative metrics in various datasets.
first_indexed	2024-04-13T17:56:48Z
format	Article
id	doaj.art-28aada4eb696429c9a9262a573783cd3
institution	Directory Open Access Journal
issn	1751-9659 1751-9667
language	English
last_indexed	2024-04-13T17:56:48Z
publishDate	2022-10-01
publisher	Wiley
record_format	Article
series	IET Image Processing
spelling	doaj.art-28aada4eb696429c9a9262a573783cd32022-12-22T02:36:27ZengWileyIET Image Processing1751-96591751-96672022-10-0116123177318910.1049/ipr2.12549CEFusion: Multi‐Modal medical image fusion via cross encoderYa Zhu0Xue Wang1Luping Chen2Rencan Nie3School of Information Science and Engineering Yunnan University Kunming 650500 ChinaSchool of Information Science and Engineering Yunnan University Kunming 650500 ChinaSchool of Information Science and Engineering Yunnan University Kunming 650500 ChinaSchool of Information Science and Engineering Yunnan University Kunming 650500 ChinaAbstract Most existing deep learning‐based multi‐modal medical image fusion (MMIF) methods utilize single‐branch feature extraction strategies to achieve good fusion performance. However, for MMIF tasks, it is thought that this structure cuts off the internal connections between source images, resulting in information redundancy and degradation of fusion performance. To this end, this paper proposes a novel unsupervised network, termed CEFusion. Different from existing architecture, a cross‐encoder is designed by exploiting the complementary properties between the original image to refine source features through feature interaction and reuse. Furthermore, to force the network to learn complementary information between source images and generate the fused image with high contrast and rich textures, a hybrid loss is proposed consisting of weighted fidelity and gradient losses. Specifically, the weighted fidelity loss can not only force the fusion results to approximate the source images but also effectively preserve the luminance information of the source image through weight estimation, while the gradient loss preserves the texture information of the source image. Experimental results demonstrate the superiority of the method over the state‐of‐the‐art in terms of subjective visual effect and quantitative metrics in various datasets.https://doi.org/10.1049/ipr2.12549
spellingShingle	Ya Zhu Xue Wang Luping Chen Rencan Nie CEFusion: Multi‐Modal medical image fusion via cross encoder IET Image Processing
title	CEFusion: Multi‐Modal medical image fusion via cross encoder
title_full	CEFusion: Multi‐Modal medical image fusion via cross encoder
title_fullStr	CEFusion: Multi‐Modal medical image fusion via cross encoder
title_full_unstemmed	CEFusion: Multi‐Modal medical image fusion via cross encoder
title_short	CEFusion: Multi‐Modal medical image fusion via cross encoder
title_sort	cefusion multi modal medical image fusion via cross encoder
url	https://doi.org/10.1049/ipr2.12549
work_keys_str_mv	AT yazhu cefusionmultimodalmedicalimagefusionviacrossencoder AT xuewang cefusionmultimodalmedicalimagefusionviacrossencoder AT lupingchen cefusionmultimodalmedicalimagefusionviacrossencoder AT rencannie cefusionmultimodalmedicalimagefusionviacrossencoder

CEFusion: Multi‐Modal medical image fusion via cross encoder

Similar Items