MashFormer: A Novel Multiscale Aware Hybrid Detector for Remote Sensing Object Detection

Object detection is a critical and demanding topic in the subject of processing satellite and airborne images. The targets acquired in remote sensing imagery are at various sizes, and the backgrounds are complicated, which makes object detection extremely challenging. We address these aforementioned...

Full description

Bibliographic Details
Main Authors: Keyan Wang, Feiyu Bai, Jiaojiao Li, Yajing Liu, Yunsong Li
Format: Article
Language:English
Published: IEEE 2023-01-01
Series:IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing
Subjects:
Online Access:https://ieeexplore.ieee.org/document/10064169/
_version_ 1797859061482389504
author Keyan Wang
Feiyu Bai
Jiaojiao Li
Yajing Liu
Yunsong Li
author_facet Keyan Wang
Feiyu Bai
Jiaojiao Li
Yajing Liu
Yunsong Li
author_sort Keyan Wang
collection DOAJ
description Object detection is a critical and demanding topic in the subject of processing satellite and airborne images. The targets acquired in remote sensing imagery are at various sizes, and the backgrounds are complicated, which makes object detection extremely challenging. We address these aforementioned issues in this article by introducing the MashFormer, an innovative multiscale aware convolutional neural network (CNN) and transformer integrated hybrid detector. Specifically, MashFormer employs the transformer block to complement the CNN-based feature extraction backbone, which could obtain the relationships between long-range features and enhance the representative ability in complex background scenarios. With the intention of improving the detection performance for objects with multiscale characteristic, since in remote sensing scenarios, the size of object varies greatly. A multilevel feature aggregation component, incorporate with a cross-level feature alignment module is designed to alleviate the semantic discrepancy between features from shallow and deep layers. To verify the effectiveness of the suggested MashFormer, comparative experiments are carried out with other cutting-edge methodologies using the publicly available high resolution remote sensing detection and Northwestern Polytechnical University VHR-10 datasets. The experimental findings confirm the effectiveness and superiority of our suggested model by indicating that our approach has greater mean average precision than the other methodologies.
first_indexed 2024-04-09T21:23:24Z
format Article
id doaj.art-0c9fa8148949498aaaf4eed7557b7346
institution Directory Open Access Journal
issn 2151-1535
language English
last_indexed 2024-04-09T21:23:24Z
publishDate 2023-01-01
publisher IEEE
record_format Article
series IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing
spelling doaj.art-0c9fa8148949498aaaf4eed7557b73462023-03-27T23:00:05ZengIEEEIEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing2151-15352023-01-01162753276310.1109/JSTARS.2023.325404710064169MashFormer: A Novel Multiscale Aware Hybrid Detector for Remote Sensing Object DetectionKeyan Wang0https://orcid.org/0000-0002-9545-718XFeiyu Bai1https://orcid.org/0009-0002-4180-7954Jiaojiao Li2https://orcid.org/0000-0002-0470-9469Yajing Liu3Yunsong Li4https://orcid.org/0000-0002-0234-6270State Key Laboratory of Integrated Service Networks, School of Telecommunications Engineering, Xidian University, Xi'an, ChinaState Key Laboratory of Integrated Service Networks, School of Telecommunications Engineering, Xidian University, Xi'an, ChinaState Key Laboratory of Integrated Service Networks, School of Telecommunications Engineering, Xidian University, Xi'an, ChinaState Key Laboratory of Integrated Service Networks, School of Telecommunications Engineering, Xidian University, Xi'an, ChinaState Key Laboratory of Integrated Service Networks, School of Telecommunications Engineering, Xidian University, Xi'an, ChinaObject detection is a critical and demanding topic in the subject of processing satellite and airborne images. The targets acquired in remote sensing imagery are at various sizes, and the backgrounds are complicated, which makes object detection extremely challenging. We address these aforementioned issues in this article by introducing the MashFormer, an innovative multiscale aware convolutional neural network (CNN) and transformer integrated hybrid detector. Specifically, MashFormer employs the transformer block to complement the CNN-based feature extraction backbone, which could obtain the relationships between long-range features and enhance the representative ability in complex background scenarios. With the intention of improving the detection performance for objects with multiscale characteristic, since in remote sensing scenarios, the size of object varies greatly. A multilevel feature aggregation component, incorporate with a cross-level feature alignment module is designed to alleviate the semantic discrepancy between features from shallow and deep layers. To verify the effectiveness of the suggested MashFormer, comparative experiments are carried out with other cutting-edge methodologies using the publicly available high resolution remote sensing detection and Northwestern Polytechnical University VHR-10 datasets. The experimental findings confirm the effectiveness and superiority of our suggested model by indicating that our approach has greater mean average precision than the other methodologies.https://ieeexplore.ieee.org/document/10064169/Feature alignmentobject detectionremote sensingtransformer
spellingShingle Keyan Wang
Feiyu Bai
Jiaojiao Li
Yajing Liu
Yunsong Li
MashFormer: A Novel Multiscale Aware Hybrid Detector for Remote Sensing Object Detection
IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing
Feature alignment
object detection
remote sensing
transformer
title MashFormer: A Novel Multiscale Aware Hybrid Detector for Remote Sensing Object Detection
title_full MashFormer: A Novel Multiscale Aware Hybrid Detector for Remote Sensing Object Detection
title_fullStr MashFormer: A Novel Multiscale Aware Hybrid Detector for Remote Sensing Object Detection
title_full_unstemmed MashFormer: A Novel Multiscale Aware Hybrid Detector for Remote Sensing Object Detection
title_short MashFormer: A Novel Multiscale Aware Hybrid Detector for Remote Sensing Object Detection
title_sort mashformer a novel multiscale aware hybrid detector for remote sensing object detection
topic Feature alignment
object detection
remote sensing
transformer
url https://ieeexplore.ieee.org/document/10064169/
work_keys_str_mv AT keyanwang mashformeranovelmultiscaleawarehybriddetectorforremotesensingobjectdetection
AT feiyubai mashformeranovelmultiscaleawarehybriddetectorforremotesensingobjectdetection
AT jiaojiaoli mashformeranovelmultiscaleawarehybriddetectorforremotesensingobjectdetection
AT yajingliu mashformeranovelmultiscaleawarehybriddetectorforremotesensingobjectdetection
AT yunsongli mashformeranovelmultiscaleawarehybriddetectorforremotesensingobjectdetection