Multi-Level Feature Extraction Networks for Hyperspectral Image Classification

Hyperspectral image (HSI) classification plays a key role in the field of earth observation missions. Recently, transformer-based approaches have been widely used for HSI classification due to their ability to model long-range sequences. However, these methods face two main challenges. First, they t...

Full description

Bibliographic Details
Main Authors:	Shaoyi Fang, Xinyu Li, Shimao Tian, Weihao Chen, Erlei Zhang
Format:	Article
Language:	English
Published:	MDPI AG 2024-02-01
Series:	Remote Sensing
Subjects:	hyperspectral image classification convolutional neural networks vision transformer
Online Access:	https://www.mdpi.com/2072-4292/16/3/590

_version_	1797318229004124160
author	Shaoyi Fang Xinyu Li Shimao Tian Weihao Chen Erlei Zhang
author_facet	Shaoyi Fang Xinyu Li Shimao Tian Weihao Chen Erlei Zhang
author_sort	Shaoyi Fang
collection	DOAJ
description	Hyperspectral image (HSI) classification plays a key role in the field of earth observation missions. Recently, transformer-based approaches have been widely used for HSI classification due to their ability to model long-range sequences. However, these methods face two main challenges. First, they treat HSI as linear vectors, disregarding their 3D attributes and spatial structure. Second, the repeated concatenation of encoders leads to information loss and gradient vanishing. To overcome these challenges, we propose a new solution called the multi-level feature extraction network (MLFEN). MLFEN consists of two sub-networks: the hybrid convolutional attention module (HCAM) and the enhanced dense vision transformer (EDVT). HCAM incorporates a band shift strategy to eliminate the edge effect of convolution and utilizes hybrid convolutional blocks to capture the 3D properties and spatial structure of HSI. Additionally, an attention module is introduced to identify strongly discriminative features. EDVT reconfigures the organization of original encoders by incorporating dense connections and adaptive feature fusion components, enabling faster propagation of information and mitigating the problem of gradient vanishing. Furthermore, we propose a novel sparse loss function to better fit the data distribution. Extensive experiments conducted on three public datasets demonstrate the significant advancements achieved by MLFEN.
first_indexed	2024-03-08T03:49:23Z
format	Article
id	doaj.art-031ce165fd654575ac7c443ebac95916
institution	Directory Open Access Journal
issn	2072-4292
language	English
last_indexed	2024-03-08T03:49:23Z
publishDate	2024-02-01
publisher	MDPI AG
record_format	Article
series	Remote Sensing
spelling	doaj.art-031ce165fd654575ac7c443ebac959162024-02-09T15:21:35ZengMDPI AGRemote Sensing2072-42922024-02-0116359010.3390/rs16030590Multi-Level Feature Extraction Networks for Hyperspectral Image ClassificationShaoyi Fang0Xinyu Li1Shimao Tian2Weihao Chen3Erlei Zhang4School of Information Engineering, Northwest A&F University, Xi’an 712100, ChinaSchool of Information Engineering, Northwest A&F University, Xi’an 712100, ChinaSchool of Information Engineering, Northwest A&F University, Xi’an 712100, ChinaSchool of Information Engineering, Northwest A&F University, Xi’an 712100, ChinaSchool of Information Engineering, Northwest A&F University, Xi’an 712100, ChinaHyperspectral image (HSI) classification plays a key role in the field of earth observation missions. Recently, transformer-based approaches have been widely used for HSI classification due to their ability to model long-range sequences. However, these methods face two main challenges. First, they treat HSI as linear vectors, disregarding their 3D attributes and spatial structure. Second, the repeated concatenation of encoders leads to information loss and gradient vanishing. To overcome these challenges, we propose a new solution called the multi-level feature extraction network (MLFEN). MLFEN consists of two sub-networks: the hybrid convolutional attention module (HCAM) and the enhanced dense vision transformer (EDVT). HCAM incorporates a band shift strategy to eliminate the edge effect of convolution and utilizes hybrid convolutional blocks to capture the 3D properties and spatial structure of HSI. Additionally, an attention module is introduced to identify strongly discriminative features. EDVT reconfigures the organization of original encoders by incorporating dense connections and adaptive feature fusion components, enabling faster propagation of information and mitigating the problem of gradient vanishing. Furthermore, we propose a novel sparse loss function to better fit the data distribution. Extensive experiments conducted on three public datasets demonstrate the significant advancements achieved by MLFEN.https://www.mdpi.com/2072-4292/16/3/590hyperspectral image classificationconvolutional neural networksvision transformer
spellingShingle	Shaoyi Fang Xinyu Li Shimao Tian Weihao Chen Erlei Zhang Multi-Level Feature Extraction Networks for Hyperspectral Image Classification Remote Sensing hyperspectral image classification convolutional neural networks vision transformer
title	Multi-Level Feature Extraction Networks for Hyperspectral Image Classification
title_full	Multi-Level Feature Extraction Networks for Hyperspectral Image Classification
title_fullStr	Multi-Level Feature Extraction Networks for Hyperspectral Image Classification
title_full_unstemmed	Multi-Level Feature Extraction Networks for Hyperspectral Image Classification
title_short	Multi-Level Feature Extraction Networks for Hyperspectral Image Classification
title_sort	multi level feature extraction networks for hyperspectral image classification
topic	hyperspectral image classification convolutional neural networks vision transformer
url	https://www.mdpi.com/2072-4292/16/3/590
work_keys_str_mv	AT shaoyifang multilevelfeatureextractionnetworksforhyperspectralimageclassification AT xinyuli multilevelfeatureextractionnetworksforhyperspectralimageclassification AT shimaotian multilevelfeatureextractionnetworksforhyperspectralimageclassification AT weihaochen multilevelfeatureextractionnetworksforhyperspectralimageclassification AT erleizhang multilevelfeatureextractionnetworksforhyperspectralimageclassification

Multi-Level Feature Extraction Networks for Hyperspectral Image Classification

Similar Items