BFE-Net: Bidirectional Multi-Scale Feature Enhancement for Small Object Detection

Small object detection becomes a challenging problem in computer vision due to low resolution and less feature information. Making full use of high-resolution features is an important factor in improving small object detection. In this paper, to improve the utilization of high-resolution features, t...

Full description

Bibliographic Details
Main Authors: Qian Zhang, Jie Ren, Hong Liang, Ying Yang, Lu Chen
Format: Article
Language:English
Published: MDPI AG 2022-04-01
Series:Applied Sciences
Subjects:
Online Access:https://www.mdpi.com/2076-3417/12/7/3587
_version_ 1797440315081097216
author Qian Zhang
Jie Ren
Hong Liang
Ying Yang
Lu Chen
author_facet Qian Zhang
Jie Ren
Hong Liang
Ying Yang
Lu Chen
author_sort Qian Zhang
collection DOAJ
description Small object detection becomes a challenging problem in computer vision due to low resolution and less feature information. Making full use of high-resolution features is an important factor in improving small object detection. In this paper, to improve the utilization of high-resolution features, this work proposes the Bidirectional Multi-scale Feature Enhancement Network (BFE-Net) based on RetinaNet. First, this work introduces a bidirectional feature pyramid structure to shorten the propagation path of high-resolution features. Then, this work utilizes residually connected dilated convolutional blocks to fully extract high-resolution features of low-feature layers. Finally, this work supplements the high-resolution features lost in the high-level feature propagation process by leveraging the high-level guided lower-level features. Experiments show that our proposed BFE-Net achieves stable performance gains in the object detection task. Specifically, the improved method improves RetinaNet from 34.4 AP to 36.3 AP on the challenging MS COCO dataset and especially achieves excellent results in small object detection with an improvement of 2.8%.
first_indexed 2024-03-09T12:05:25Z
format Article
id doaj.art-252be8739075450bbea847ebe964f4e8
institution Directory Open Access Journal
issn 2076-3417
language English
last_indexed 2024-03-09T12:05:25Z
publishDate 2022-04-01
publisher MDPI AG
record_format Article
series Applied Sciences
spelling doaj.art-252be8739075450bbea847ebe964f4e82023-11-30T22:57:48ZengMDPI AGApplied Sciences2076-34172022-04-01127358710.3390/app12073587BFE-Net: Bidirectional Multi-Scale Feature Enhancement for Small Object DetectionQian Zhang0Jie Ren1Hong Liang2Ying Yang3Lu Chen4College of Computer Science and Technology, China University of Petroleum (East China), Qingdao 266580, ChinaCollege of Computer Science and Technology, China University of Petroleum (East China), Qingdao 266580, ChinaCollege of Computer Science and Technology, China University of Petroleum (East China), Qingdao 266580, ChinaCollege of Computer Science and Technology, China University of Petroleum (East China), Qingdao 266580, ChinaCollege of Computer Science and Technology, China University of Petroleum (East China), Qingdao 266580, ChinaSmall object detection becomes a challenging problem in computer vision due to low resolution and less feature information. Making full use of high-resolution features is an important factor in improving small object detection. In this paper, to improve the utilization of high-resolution features, this work proposes the Bidirectional Multi-scale Feature Enhancement Network (BFE-Net) based on RetinaNet. First, this work introduces a bidirectional feature pyramid structure to shorten the propagation path of high-resolution features. Then, this work utilizes residually connected dilated convolutional blocks to fully extract high-resolution features of low-feature layers. Finally, this work supplements the high-resolution features lost in the high-level feature propagation process by leveraging the high-level guided lower-level features. Experiments show that our proposed BFE-Net achieves stable performance gains in the object detection task. Specifically, the improved method improves RetinaNet from 34.4 AP to 36.3 AP on the challenging MS COCO dataset and especially achieves excellent results in small object detection with an improvement of 2.8%.https://www.mdpi.com/2076-3417/12/7/3587small object detectiondilated convolutionattention mechanismfeature pyramid
spellingShingle Qian Zhang
Jie Ren
Hong Liang
Ying Yang
Lu Chen
BFE-Net: Bidirectional Multi-Scale Feature Enhancement for Small Object Detection
Applied Sciences
small object detection
dilated convolution
attention mechanism
feature pyramid
title BFE-Net: Bidirectional Multi-Scale Feature Enhancement for Small Object Detection
title_full BFE-Net: Bidirectional Multi-Scale Feature Enhancement for Small Object Detection
title_fullStr BFE-Net: Bidirectional Multi-Scale Feature Enhancement for Small Object Detection
title_full_unstemmed BFE-Net: Bidirectional Multi-Scale Feature Enhancement for Small Object Detection
title_short BFE-Net: Bidirectional Multi-Scale Feature Enhancement for Small Object Detection
title_sort bfe net bidirectional multi scale feature enhancement for small object detection
topic small object detection
dilated convolution
attention mechanism
feature pyramid
url https://www.mdpi.com/2076-3417/12/7/3587
work_keys_str_mv AT qianzhang bfenetbidirectionalmultiscalefeatureenhancementforsmallobjectdetection
AT jieren bfenetbidirectionalmultiscalefeatureenhancementforsmallobjectdetection
AT hongliang bfenetbidirectionalmultiscalefeatureenhancementforsmallobjectdetection
AT yingyang bfenetbidirectionalmultiscalefeatureenhancementforsmallobjectdetection
AT luchen bfenetbidirectionalmultiscalefeatureenhancementforsmallobjectdetection