AFRE-Net: Adaptive Feature Representation Enhancement for Arbitrary Oriented Object Detection

Arbitrary-oriented object detection (AOOD) is a crucial task in aerial image analysis but is also faced with significant challenges. In current AOOD detectors, commonly used multi-scale feature fusion modules fall short in spatial and semantic information complement between scales. Additionally, fix...

Full description

Bibliographic Details
Main Authors: Tianwei Zhang, Xu Sun, Lina Zhuang, Xiaoyu Dong, Jianjun Sha, Bing Zhang, Ke Zheng
Format: Article
Language:English
Published: MDPI AG 2023-10-01
Series:Remote Sensing
Subjects:
Online Access:https://www.mdpi.com/2072-4292/15/20/4965
_version_ 1797572377577521152
author Tianwei Zhang
Xu Sun
Lina Zhuang
Xiaoyu Dong
Jianjun Sha
Bing Zhang
Ke Zheng
author_facet Tianwei Zhang
Xu Sun
Lina Zhuang
Xiaoyu Dong
Jianjun Sha
Bing Zhang
Ke Zheng
author_sort Tianwei Zhang
collection DOAJ
description Arbitrary-oriented object detection (AOOD) is a crucial task in aerial image analysis but is also faced with significant challenges. In current AOOD detectors, commonly used multi-scale feature fusion modules fall short in spatial and semantic information complement between scales. Additionally, fixed feature extraction structures are usually used following a fusion model, resulting in the inability of detectors to self-adjust. At the same time, feature fusion and extraction modules are designed in isolation and the internal synergy between them is ignored. The above problems result in feature representation deficiency, thus affecting the overall detection precision. To solve these problems, we first create a fine-grained feature pyramid network (FG-FPN) that not only provides richer spatial and semantic features, but also completes neighbor scale features in a self-learning mode. Subsequently, we propose a novel feature enhancement module (FEM) to fit FG-FPN. FEM authorizes the detection unit to automatically adjust the sensing area and adaptively suppress background interference, thereby generating stronger feature representations. Our proposed solution was tested through extensive experiments on challenging datasets, including DOTA (77.44% mAP), HRSC2016 (97.82% mAP), UCAS-AOD (91.34% mAP), as well as ICDAR2015 (86.27% F-score) and its effectiveness and high applicability are verified on all the above datasets.
first_indexed 2024-03-10T20:55:25Z
format Article
id doaj.art-293a1ce101f64c5a89d57c6603c2494d
institution Directory Open Access Journal
issn 2072-4292
language English
last_indexed 2024-03-10T20:55:25Z
publishDate 2023-10-01
publisher MDPI AG
record_format Article
series Remote Sensing
spelling doaj.art-293a1ce101f64c5a89d57c6603c2494d2023-11-19T17:58:56ZengMDPI AGRemote Sensing2072-42922023-10-011520496510.3390/rs15204965AFRE-Net: Adaptive Feature Representation Enhancement for Arbitrary Oriented Object DetectionTianwei Zhang0Xu Sun1Lina Zhuang2Xiaoyu Dong3Jianjun Sha4Bing Zhang5Ke Zheng6Key Laboratory of Computational Optical Imaging Technology, Aerospace Information Research Institute, Chinese Academy of Sciences, Beijing 100094, ChinaKey Laboratory of Digital Earth Science, Aerospace Information Research Institute, Chinese Academy of Sciences, Beijing 100094, ChinaKey Laboratory of Digital Earth Science, Aerospace Information Research Institute, Chinese Academy of Sciences, Beijing 100094, ChinaDepartment of Complexity Science and Engineering, The University of Tokyo, Tokyo 277-8561, JapanCollege of Intelligent Systems Science and Engineering, Harbin Engineering University, Harbin 150001, ChinaCollege of Resources and Environment, University of Chinese Academy of Sciences, Beijing 100049, ChinaCollege of Geography and Environment, Liaocheng University, Liaocheng 252059, ChinaArbitrary-oriented object detection (AOOD) is a crucial task in aerial image analysis but is also faced with significant challenges. In current AOOD detectors, commonly used multi-scale feature fusion modules fall short in spatial and semantic information complement between scales. Additionally, fixed feature extraction structures are usually used following a fusion model, resulting in the inability of detectors to self-adjust. At the same time, feature fusion and extraction modules are designed in isolation and the internal synergy between them is ignored. The above problems result in feature representation deficiency, thus affecting the overall detection precision. To solve these problems, we first create a fine-grained feature pyramid network (FG-FPN) that not only provides richer spatial and semantic features, but also completes neighbor scale features in a self-learning mode. Subsequently, we propose a novel feature enhancement module (FEM) to fit FG-FPN. FEM authorizes the detection unit to automatically adjust the sensing area and adaptively suppress background interference, thereby generating stronger feature representations. Our proposed solution was tested through extensive experiments on challenging datasets, including DOTA (77.44% mAP), HRSC2016 (97.82% mAP), UCAS-AOD (91.34% mAP), as well as ICDAR2015 (86.27% F-score) and its effectiveness and high applicability are verified on all the above datasets.https://www.mdpi.com/2072-4292/15/20/4965deep learningobject detectionremote sensingfeature representation enhancement
spellingShingle Tianwei Zhang
Xu Sun
Lina Zhuang
Xiaoyu Dong
Jianjun Sha
Bing Zhang
Ke Zheng
AFRE-Net: Adaptive Feature Representation Enhancement for Arbitrary Oriented Object Detection
Remote Sensing
deep learning
object detection
remote sensing
feature representation enhancement
title AFRE-Net: Adaptive Feature Representation Enhancement for Arbitrary Oriented Object Detection
title_full AFRE-Net: Adaptive Feature Representation Enhancement for Arbitrary Oriented Object Detection
title_fullStr AFRE-Net: Adaptive Feature Representation Enhancement for Arbitrary Oriented Object Detection
title_full_unstemmed AFRE-Net: Adaptive Feature Representation Enhancement for Arbitrary Oriented Object Detection
title_short AFRE-Net: Adaptive Feature Representation Enhancement for Arbitrary Oriented Object Detection
title_sort afre net adaptive feature representation enhancement for arbitrary oriented object detection
topic deep learning
object detection
remote sensing
feature representation enhancement
url https://www.mdpi.com/2072-4292/15/20/4965
work_keys_str_mv AT tianweizhang afrenetadaptivefeaturerepresentationenhancementforarbitraryorientedobjectdetection
AT xusun afrenetadaptivefeaturerepresentationenhancementforarbitraryorientedobjectdetection
AT linazhuang afrenetadaptivefeaturerepresentationenhancementforarbitraryorientedobjectdetection
AT xiaoyudong afrenetadaptivefeaturerepresentationenhancementforarbitraryorientedobjectdetection
AT jianjunsha afrenetadaptivefeaturerepresentationenhancementforarbitraryorientedobjectdetection
AT bingzhang afrenetadaptivefeaturerepresentationenhancementforarbitraryorientedobjectdetection
AT kezheng afrenetadaptivefeaturerepresentationenhancementforarbitraryorientedobjectdetection