Real-Time Infrared Small Target Detection With Nonlocal Spatial-Temporal Feature Fusion

Infrared small target detection is a challenging task in which many researchers have made lots of achievements. While the performance of single-frame (SF) detection is still limited due to the lack of usage of multiframe (MF) continuous information, many spatial-temporal detection methods have been...

Full description

Bibliographic Details
Main Authors: Hai Xu, Sheng Zhong, Tianxu Zhang, Xu Zou
Format: Article
Language:English
Published: IEEE 2024-01-01
Series:IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing
Subjects:
Online Access:https://ieeexplore.ieee.org/document/10480567/
_version_ 1797201323614011392
author Hai Xu
Sheng Zhong
Tianxu Zhang
Xu Zou
author_facet Hai Xu
Sheng Zhong
Tianxu Zhang
Xu Zou
author_sort Hai Xu
collection DOAJ
description Infrared small target detection is a challenging task in which many researchers have made lots of achievements. While the performance of single-frame (SF) detection is still limited due to the lack of usage of multiframe (MF) continuous information, many spatial-temporal detection methods have been developed. However, most algorithms need to register the image or feed a set of images as the input. Inputting a batch of group images usually leads to large computations, which heavily affects their real-time processing capability in resource-limited machines. To tackle the problem, we propose a nonlocal multiframe network (NLMF-Net) with only a few additional computations (no more than 0.01 GFLOPs) compared to the SF baseline while achieving significant performance improvements. The NLMF-Net correlates features from grid cells with high confidence between current and past frames. While most background grid cells are removed after the SF processing, the MF feature fusion only focuses on a few potential target grid cells, resulting in high computation efficiency. The proposed vector length similarity module enlarges the difference between different grid cells and the non max similarity suppression further suppresses the backgrounds during the fusion, promoting the MF performance. The NLMF-Net can be readily deployed on Jetson Nano at a speed of 20 FPS for 288 × 384 image processing or Mi Pad 2 with a speed over 35 FPS for 128 × 128 part image processing. Extensive experiments show that our proposed method achieves state-of-the-art performance on three datasets while maintaining high efficiency in a real-time processing manner.
first_indexed 2024-04-24T07:45:43Z
format Article
id doaj.art-66d66ace9ba648c892e1a97bdc685095
institution Directory Open Access Journal
issn 1939-1404
2151-1535
language English
last_indexed 2024-04-24T07:45:43Z
publishDate 2024-01-01
publisher IEEE
record_format Article
series IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing
spelling doaj.art-66d66ace9ba648c892e1a97bdc6850952024-04-18T23:00:17ZengIEEEIEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing1939-14042151-15352024-01-01177888790210.1109/JSTARS.2024.338238910480567Real-Time Infrared Small Target Detection With Nonlocal Spatial-Temporal Feature FusionHai Xu0https://orcid.org/0000-0002-3674-180XSheng Zhong1https://orcid.org/0000-0003-2865-8202Tianxu Zhang2https://orcid.org/0009-0003-1071-4146Xu Zou3https://orcid.org/0000-0002-0251-7404National Key Laboratory of Multispectral Information Intelligent Processing Technology/ School of Artificial Intelligence and Automation, Huazhong University of Science and Technology, Wuhan, ChinaNational Key Laboratory of Multispectral Information Intelligent Processing Technology/ School of Artificial Intelligence and Automation, Huazhong University of Science and Technology, Wuhan, ChinaNational Key Laboratory of Multispectral Information Intelligent Processing Technology/ School of Artificial Intelligence and Automation, Huazhong University of Science and Technology, Wuhan, ChinaNational Key Laboratory of Multispectral Information Intelligent Processing Technology/ School of Artificial Intelligence and Automation, Huazhong University of Science and Technology, Wuhan, ChinaInfrared small target detection is a challenging task in which many researchers have made lots of achievements. While the performance of single-frame (SF) detection is still limited due to the lack of usage of multiframe (MF) continuous information, many spatial-temporal detection methods have been developed. However, most algorithms need to register the image or feed a set of images as the input. Inputting a batch of group images usually leads to large computations, which heavily affects their real-time processing capability in resource-limited machines. To tackle the problem, we propose a nonlocal multiframe network (NLMF-Net) with only a few additional computations (no more than 0.01 GFLOPs) compared to the SF baseline while achieving significant performance improvements. The NLMF-Net correlates features from grid cells with high confidence between current and past frames. While most background grid cells are removed after the SF processing, the MF feature fusion only focuses on a few potential target grid cells, resulting in high computation efficiency. The proposed vector length similarity module enlarges the difference between different grid cells and the non max similarity suppression further suppresses the backgrounds during the fusion, promoting the MF performance. The NLMF-Net can be readily deployed on Jetson Nano at a speed of 20 FPS for 288 × 384 image processing or Mi Pad 2 with a speed over 35 FPS for 128 × 128 part image processing. Extensive experiments show that our proposed method achieves state-of-the-art performance on three datasets while maintaining high efficiency in a real-time processing manner.https://ieeexplore.ieee.org/document/10480567/Convolution neural network (CNN)infrared small target (IST) detectionreal-time processingspatial-temporal (ST) fusion
spellingShingle Hai Xu
Sheng Zhong
Tianxu Zhang
Xu Zou
Real-Time Infrared Small Target Detection With Nonlocal Spatial-Temporal Feature Fusion
IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing
Convolution neural network (CNN)
infrared small target (IST) detection
real-time processing
spatial-temporal (ST) fusion
title Real-Time Infrared Small Target Detection With Nonlocal Spatial-Temporal Feature Fusion
title_full Real-Time Infrared Small Target Detection With Nonlocal Spatial-Temporal Feature Fusion
title_fullStr Real-Time Infrared Small Target Detection With Nonlocal Spatial-Temporal Feature Fusion
title_full_unstemmed Real-Time Infrared Small Target Detection With Nonlocal Spatial-Temporal Feature Fusion
title_short Real-Time Infrared Small Target Detection With Nonlocal Spatial-Temporal Feature Fusion
title_sort real time infrared small target detection with nonlocal spatial temporal feature fusion
topic Convolution neural network (CNN)
infrared small target (IST) detection
real-time processing
spatial-temporal (ST) fusion
url https://ieeexplore.ieee.org/document/10480567/
work_keys_str_mv AT haixu realtimeinfraredsmalltargetdetectionwithnonlocalspatialtemporalfeaturefusion
AT shengzhong realtimeinfraredsmalltargetdetectionwithnonlocalspatialtemporalfeaturefusion
AT tianxuzhang realtimeinfraredsmalltargetdetectionwithnonlocalspatialtemporalfeaturefusion
AT xuzou realtimeinfraredsmalltargetdetectionwithnonlocalspatialtemporalfeaturefusion