SS-TMNet: Spatial–Spectral Transformer Network with Multi-Scale Convolution for Hyperspectral Image Classification

Hyperspectral image (HSI) classification is a significant foundation for remote sensing image analysis, widely used in biology, aerospace, and other applications. Convolution neural networks (CNNs) and attention mechanisms have shown outstanding ability in HSI classification and have been widely stu...

Full description

Bibliographic Details
Main Authors:	Xiaohui Huang, Yunfei Zhou, Xiaofei Yang, Xianhong Zhu, Ke Wang
Format:	Article
Language:	English
Published:	MDPI AG 2023-02-01
Series:	Remote Sensing
Subjects:	multi-scale 3D convolution convolution neural network (CNN) attention mechanism hyperspectral image (HSI) classification
Online Access:	https://www.mdpi.com/2072-4292/15/5/1206

_version_	1797614419835879424
author	Xiaohui Huang Yunfei Zhou Xiaofei Yang Xianhong Zhu Ke Wang
author_facet	Xiaohui Huang Yunfei Zhou Xiaofei Yang Xianhong Zhu Ke Wang
author_sort	Xiaohui Huang
collection	DOAJ
description	Hyperspectral image (HSI) classification is a significant foundation for remote sensing image analysis, widely used in biology, aerospace, and other applications. Convolution neural networks (CNNs) and attention mechanisms have shown outstanding ability in HSI classification and have been widely studied in recent years. However, the existing CNN-based and attention mechanism-based methods cannot fully use spatial–spectral information, which is not conducive to further improving HSI classification accuracy. This paper proposes a new spatial–spectral Transformer network with multi-scale convolution (SS-TMNet), which can effectively extract local and global spatial–spectral information. SS-TMNet includes two key modules, i.e., multi-scale 3D convolution projection module (MSCP) and spatial–spectral attention module (SSAM). The MSCP uses multi-scale 3D convolutions with different depths to extract the fused spatial–spectral features. The spatial–spectral attention module includes three branches: height spatial attention, width spatial attention, and spectral attention, which can extract the fusion information of spatial and spectral features. The proposed SS-TMNet was tested on three widely used HSI datasets: Pavia University, IndianPines, and Houston2013. The experimental results show that the proposed SS-TMNet is superior to the existing methods.
first_indexed	2024-03-11T07:12:08Z
format	Article
id	doaj.art-1e601a081cd2490e941aea1a85021057
institution	Directory Open Access Journal
issn	2072-4292
language	English
last_indexed	2024-03-11T07:12:08Z
publishDate	2023-02-01
publisher	MDPI AG
record_format	Article
series	Remote Sensing
spelling	doaj.art-1e601a081cd2490e941aea1a850210572023-11-17T08:29:51ZengMDPI AGRemote Sensing2072-42922023-02-01155120610.3390/rs15051206SS-TMNet: Spatial–Spectral Transformer Network with Multi-Scale Convolution for Hyperspectral Image ClassificationXiaohui Huang0Yunfei Zhou1Xiaofei Yang2Xianhong Zhu3Ke Wang4School of Information Engineering, East China Jiaotong University, Nanchang 330013, ChinaSchool of Information Engineering, East China Jiaotong University, Nanchang 330013, ChinaThe Department of Computer and Information Science, University of Macau, Macau 519000, ChinaSchool of Information Engineering, East China Jiaotong University, Nanchang 330013, ChinaSchool of Computer Science, Shenyang Aerospace University, Shenyang 110136, ChinaHyperspectral image (HSI) classification is a significant foundation for remote sensing image analysis, widely used in biology, aerospace, and other applications. Convolution neural networks (CNNs) and attention mechanisms have shown outstanding ability in HSI classification and have been widely studied in recent years. However, the existing CNN-based and attention mechanism-based methods cannot fully use spatial–spectral information, which is not conducive to further improving HSI classification accuracy. This paper proposes a new spatial–spectral Transformer network with multi-scale convolution (SS-TMNet), which can effectively extract local and global spatial–spectral information. SS-TMNet includes two key modules, i.e., multi-scale 3D convolution projection module (MSCP) and spatial–spectral attention module (SSAM). The MSCP uses multi-scale 3D convolutions with different depths to extract the fused spatial–spectral features. The spatial–spectral attention module includes three branches: height spatial attention, width spatial attention, and spectral attention, which can extract the fusion information of spatial and spectral features. The proposed SS-TMNet was tested on three widely used HSI datasets: Pavia University, IndianPines, and Houston2013. The experimental results show that the proposed SS-TMNet is superior to the existing methods.https://www.mdpi.com/2072-4292/15/5/1206multi-scale 3D convolutionconvolution neural network (CNN)attention mechanismhyperspectral image (HSI) classification
spellingShingle	Xiaohui Huang Yunfei Zhou Xiaofei Yang Xianhong Zhu Ke Wang SS-TMNet: Spatial–Spectral Transformer Network with Multi-Scale Convolution for Hyperspectral Image Classification Remote Sensing multi-scale 3D convolution convolution neural network (CNN) attention mechanism hyperspectral image (HSI) classification
title	SS-TMNet: Spatial–Spectral Transformer Network with Multi-Scale Convolution for Hyperspectral Image Classification
title_full	SS-TMNet: Spatial–Spectral Transformer Network with Multi-Scale Convolution for Hyperspectral Image Classification
title_fullStr	SS-TMNet: Spatial–Spectral Transformer Network with Multi-Scale Convolution for Hyperspectral Image Classification
title_full_unstemmed	SS-TMNet: Spatial–Spectral Transformer Network with Multi-Scale Convolution for Hyperspectral Image Classification
title_short	SS-TMNet: Spatial–Spectral Transformer Network with Multi-Scale Convolution for Hyperspectral Image Classification
title_sort	ss tmnet spatial spectral transformer network with multi scale convolution for hyperspectral image classification
topic	multi-scale 3D convolution convolution neural network (CNN) attention mechanism hyperspectral image (HSI) classification
url	https://www.mdpi.com/2072-4292/15/5/1206
work_keys_str_mv	AT xiaohuihuang sstmnetspatialspectraltransformernetworkwithmultiscaleconvolutionforhyperspectralimageclassification AT yunfeizhou sstmnetspatialspectraltransformernetworkwithmultiscaleconvolutionforhyperspectralimageclassification AT xiaofeiyang sstmnetspatialspectraltransformernetworkwithmultiscaleconvolutionforhyperspectralimageclassification AT xianhongzhu sstmnetspatialspectraltransformernetworkwithmultiscaleconvolutionforhyperspectralimageclassification AT kewang sstmnetspatialspectraltransformernetworkwithmultiscaleconvolutionforhyperspectralimageclassification

SS-TMNet: Spatial–Spectral Transformer Network with Multi-Scale Convolution for Hyperspectral Image Classification

Similar Items