Binary residual feature pyramid network: An improved feature fusion module based on double‐channel residual pyramid structure for autonomous detection algorithm

Abstract The vehicle detection algorithm based on visual perception has been applied in all types of automatic driving scenes. However, there are still flaws in the current detection algorithm model, especially for small objects. The detection effect of vehicle objects with small pixels in the image...

Full description

Bibliographic Details
Main Authors: Tong Luo, Hai Wang, Yingfeng Cai, Long Chen, Kuan Wang, Yijie Yu
Format: Article
Language:English
Published: Wiley 2023-07-01
Series:IET Intelligent Transport Systems
Online Access:https://doi.org/10.1049/itr2.12291
_version_ 1797777636130291712
author Tong Luo
Hai Wang
Yingfeng Cai
Long Chen
Kuan Wang
Yijie Yu
author_facet Tong Luo
Hai Wang
Yingfeng Cai
Long Chen
Kuan Wang
Yijie Yu
author_sort Tong Luo
collection DOAJ
description Abstract The vehicle detection algorithm based on visual perception has been applied in all types of automatic driving scenes. However, there are still flaws in the current detection algorithm model, especially for small objects. The detection effect of vehicle objects with small pixels in the image is often missed and wrongly detected. This research proposes an improved feature fusion module based on double‐channel residual pyramid (DRP) structure for autonomous detection algorithm which named binary residual feature pyramid network (BiResFPN) to solve the above problems. Firstly, a DRP structure, which can effectively supplement the shallow information of the network, is proposed. The residual structure is added to the output feature map for further supplement. Then, an average sampling method of positive and negative samples based on intersection‐over‐union (IOU) value is proposed on the basis of this structure, aimed at the unbalanced sampling of positive and negative samples in the training stage of faster regions with CNN features (RCNN). It leads to the reduction of the interference of a large number of simple negative samples, which makes the learned model better. The experimental results based on the KITTI and BDD100K dataset datasets show that the capability of the feature fusion module based on DRP structure is strong for small object detection. Compared with Faster‐RCNN (FPN), the detection algorithm of small object detection accuracy APsmall was increased by 2.6%, APmedium and APlarge was increased by 1.1% and 0.3%.
first_indexed 2024-03-12T23:06:39Z
format Article
id doaj.art-617a34ff0ce547a2a7ced51c6a3db484
institution Directory Open Access Journal
issn 1751-956X
1751-9578
language English
last_indexed 2024-03-12T23:06:39Z
publishDate 2023-07-01
publisher Wiley
record_format Article
series IET Intelligent Transport Systems
spelling doaj.art-617a34ff0ce547a2a7ced51c6a3db4842023-07-18T15:38:52ZengWileyIET Intelligent Transport Systems1751-956X1751-95782023-07-011771288130110.1049/itr2.12291Binary residual feature pyramid network: An improved feature fusion module based on double‐channel residual pyramid structure for autonomous detection algorithmTong Luo0Hai Wang1Yingfeng Cai2Long Chen3Kuan Wang4Yijie Yu5Automotive Engineering Research Institute of Jiangsu University Zhenjiang People's Republic of ChinaSchool of Automotive and Traffic Engineering of Jiangsu University Zhenjiang People's Republic of ChinaAutomotive Engineering Research Institute of Jiangsu University Zhenjiang People's Republic of ChinaAutomotive Engineering Research Institute of Jiangsu University Zhenjiang People's Republic of ChinaSchool of Automotive and Traffic Engineering of Jiangsu University Zhenjiang People's Republic of ChinaSchool of Automotive and Traffic Engineering of Jiangsu University Zhenjiang People's Republic of ChinaAbstract The vehicle detection algorithm based on visual perception has been applied in all types of automatic driving scenes. However, there are still flaws in the current detection algorithm model, especially for small objects. The detection effect of vehicle objects with small pixels in the image is often missed and wrongly detected. This research proposes an improved feature fusion module based on double‐channel residual pyramid (DRP) structure for autonomous detection algorithm which named binary residual feature pyramid network (BiResFPN) to solve the above problems. Firstly, a DRP structure, which can effectively supplement the shallow information of the network, is proposed. The residual structure is added to the output feature map for further supplement. Then, an average sampling method of positive and negative samples based on intersection‐over‐union (IOU) value is proposed on the basis of this structure, aimed at the unbalanced sampling of positive and negative samples in the training stage of faster regions with CNN features (RCNN). It leads to the reduction of the interference of a large number of simple negative samples, which makes the learned model better. The experimental results based on the KITTI and BDD100K dataset datasets show that the capability of the feature fusion module based on DRP structure is strong for small object detection. Compared with Faster‐RCNN (FPN), the detection algorithm of small object detection accuracy APsmall was increased by 2.6%, APmedium and APlarge was increased by 1.1% and 0.3%.https://doi.org/10.1049/itr2.12291
spellingShingle Tong Luo
Hai Wang
Yingfeng Cai
Long Chen
Kuan Wang
Yijie Yu
Binary residual feature pyramid network: An improved feature fusion module based on double‐channel residual pyramid structure for autonomous detection algorithm
IET Intelligent Transport Systems
title Binary residual feature pyramid network: An improved feature fusion module based on double‐channel residual pyramid structure for autonomous detection algorithm
title_full Binary residual feature pyramid network: An improved feature fusion module based on double‐channel residual pyramid structure for autonomous detection algorithm
title_fullStr Binary residual feature pyramid network: An improved feature fusion module based on double‐channel residual pyramid structure for autonomous detection algorithm
title_full_unstemmed Binary residual feature pyramid network: An improved feature fusion module based on double‐channel residual pyramid structure for autonomous detection algorithm
title_short Binary residual feature pyramid network: An improved feature fusion module based on double‐channel residual pyramid structure for autonomous detection algorithm
title_sort binary residual feature pyramid network an improved feature fusion module based on double channel residual pyramid structure for autonomous detection algorithm
url https://doi.org/10.1049/itr2.12291
work_keys_str_mv AT tongluo binaryresidualfeaturepyramidnetworkanimprovedfeaturefusionmodulebasedondoublechannelresidualpyramidstructureforautonomousdetectionalgorithm
AT haiwang binaryresidualfeaturepyramidnetworkanimprovedfeaturefusionmodulebasedondoublechannelresidualpyramidstructureforautonomousdetectionalgorithm
AT yingfengcai binaryresidualfeaturepyramidnetworkanimprovedfeaturefusionmodulebasedondoublechannelresidualpyramidstructureforautonomousdetectionalgorithm
AT longchen binaryresidualfeaturepyramidnetworkanimprovedfeaturefusionmodulebasedondoublechannelresidualpyramidstructureforautonomousdetectionalgorithm
AT kuanwang binaryresidualfeaturepyramidnetworkanimprovedfeaturefusionmodulebasedondoublechannelresidualpyramidstructureforautonomousdetectionalgorithm
AT yijieyu binaryresidualfeaturepyramidnetworkanimprovedfeaturefusionmodulebasedondoublechannelresidualpyramidstructureforautonomousdetectionalgorithm