FSOD4RSI: Few-Shot Object Detection for Remote Sensing Images via Features Aggregation and Scale Attention

Due to the continuous development of few-shot learning, there have been notable advancements in methods for few-shot object detection in recent years. However, most existing methods in this domain primarily focus on natural images, neglecting the challenges posed by variations in object scales, whic...

Full description

Bibliographic Details
Main Authors: Honghao Gao, Shuping Wu, Ye Wang, Jung Yoon Kim, Yueshen Xu
Format: Article
Language:English
Published: IEEE 2024-01-01
Series:IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing
Subjects:
Online Access:https://ieeexplore.ieee.org/document/10423123/
_version_ 1797299848524857344
author Honghao Gao
Shuping Wu
Ye Wang
Jung Yoon Kim
Yueshen Xu
author_facet Honghao Gao
Shuping Wu
Ye Wang
Jung Yoon Kim
Yueshen Xu
author_sort Honghao Gao
collection DOAJ
description Due to the continuous development of few-shot learning, there have been notable advancements in methods for few-shot object detection in recent years. However, most existing methods in this domain primarily focus on natural images, neglecting the challenges posed by variations in object scales, which are usually encountered in remote sensing images. This article proposes a new few-shot object detection model designed to handle the issue of object scale variation in remote sensing images. Our developed model has two essential parts: a feature aggregation module (FAM) and a scale-aware attention module (SAM). Considering the few-shot features of remote sensing images, we designed the FAM to improve the support and query features through channel multiplication operations utilizing a feature pyramid network and a transformer encoder. The created FAM better extracts the global features of remote sensing images and enhances the significant feature representation of few-shot remote sensing objects. In addition, we design the SAM to address the scale variation problems that frequently occur in remote sensing images. By employing multiscale convolutions, the SAM enables the acquisition of contextual features while adapting to objects of varying scales. Extensive experiments were conducted on benchmark datasets, including NWPU VHR-10 and DIOR datasets, and the results show that our model indeed addresses the challenges posed by object scale variation and improves the applicability of few-shot object detection in the remote sensing domain.
first_indexed 2024-03-07T22:57:36Z
format Article
id doaj.art-cb7597c5ee904b5c9b67f86a6efd36c6
institution Directory Open Access Journal
issn 2151-1535
language English
last_indexed 2024-03-07T22:57:36Z
publishDate 2024-01-01
publisher IEEE
record_format Article
series IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing
spelling doaj.art-cb7597c5ee904b5c9b67f86a6efd36c62024-02-23T00:00:18ZengIEEEIEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing2151-15352024-01-01174784479610.1109/JSTARS.2024.336274810423123FSOD4RSI: Few-Shot Object Detection for Remote Sensing Images via Features Aggregation and Scale AttentionHonghao Gao0https://orcid.org/0000-0001-6861-9684Shuping Wu1https://orcid.org/0009-0006-0454-2706Ye Wang2https://orcid.org/0000-0003-1454-2161Jung Yoon Kim3https://orcid.org/0000-0002-2396-9514Yueshen Xu4https://orcid.org/0000-0002-2396-9514School of Computer Engineering and Science, Shanghai University, Shanghai, ChinaSchool of Computer Engineering and Science, Shanghai University, Shanghai, ChinaSchool of Computer Engineering and Science, Shanghai University, Shanghai, ChinaCollege of Future Industry, Gachon University, Seongnam, 13120, South KoreaSchool of Computer Science and Technology, Xidian University, Xi'an, ChinaDue to the continuous development of few-shot learning, there have been notable advancements in methods for few-shot object detection in recent years. However, most existing methods in this domain primarily focus on natural images, neglecting the challenges posed by variations in object scales, which are usually encountered in remote sensing images. This article proposes a new few-shot object detection model designed to handle the issue of object scale variation in remote sensing images. Our developed model has two essential parts: a feature aggregation module (FAM) and a scale-aware attention module (SAM). Considering the few-shot features of remote sensing images, we designed the FAM to improve the support and query features through channel multiplication operations utilizing a feature pyramid network and a transformer encoder. The created FAM better extracts the global features of remote sensing images and enhances the significant feature representation of few-shot remote sensing objects. In addition, we design the SAM to address the scale variation problems that frequently occur in remote sensing images. By employing multiscale convolutions, the SAM enables the acquisition of contextual features while adapting to objects of varying scales. Extensive experiments were conducted on benchmark datasets, including NWPU VHR-10 and DIOR datasets, and the results show that our model indeed addresses the challenges posed by object scale variation and improves the applicability of few-shot object detection in the remote sensing domain.https://ieeexplore.ieee.org/document/10423123/Attention mechanismfeature aggregationfew-shot learningobject detectionremote sensing images
spellingShingle Honghao Gao
Shuping Wu
Ye Wang
Jung Yoon Kim
Yueshen Xu
FSOD4RSI: Few-Shot Object Detection for Remote Sensing Images via Features Aggregation and Scale Attention
IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing
Attention mechanism
feature aggregation
few-shot learning
object detection
remote sensing images
title FSOD4RSI: Few-Shot Object Detection for Remote Sensing Images via Features Aggregation and Scale Attention
title_full FSOD4RSI: Few-Shot Object Detection for Remote Sensing Images via Features Aggregation and Scale Attention
title_fullStr FSOD4RSI: Few-Shot Object Detection for Remote Sensing Images via Features Aggregation and Scale Attention
title_full_unstemmed FSOD4RSI: Few-Shot Object Detection for Remote Sensing Images via Features Aggregation and Scale Attention
title_short FSOD4RSI: Few-Shot Object Detection for Remote Sensing Images via Features Aggregation and Scale Attention
title_sort fsod4rsi few shot object detection for remote sensing images via features aggregation and scale attention
topic Attention mechanism
feature aggregation
few-shot learning
object detection
remote sensing images
url https://ieeexplore.ieee.org/document/10423123/
work_keys_str_mv AT honghaogao fsod4rsifewshotobjectdetectionforremotesensingimagesviafeaturesaggregationandscaleattention
AT shupingwu fsod4rsifewshotobjectdetectionforremotesensingimagesviafeaturesaggregationandscaleattention
AT yewang fsod4rsifewshotobjectdetectionforremotesensingimagesviafeaturesaggregationandscaleattention
AT jungyoonkim fsod4rsifewshotobjectdetectionforremotesensingimagesviafeaturesaggregationandscaleattention
AT yueshenxu fsod4rsifewshotobjectdetectionforremotesensingimagesviafeaturesaggregationandscaleattention