An object detection approach with residual feature fusion and second‐order term attention mechanism

Abstract Automatically detecting and locating remote occlusion small objects from the images of complex traffic environments is a valuable and challenging research. Since the boundary box location is not sufficiently accurate and it is difficult to distinguish overlapping and occluded objects, the a...

Full description

Bibliographic Details
Main Authors: Cuijin Li, Zhong Qu, Shengye Wang
Format: Article
Language:English
Published: Wiley 2024-04-01
Series:CAAI Transactions on Intelligence Technology
Subjects:
Online Access:https://doi.org/10.1049/cit2.12236
_version_ 1797201299468451840
author Cuijin Li
Zhong Qu
Shengye Wang
author_facet Cuijin Li
Zhong Qu
Shengye Wang
author_sort Cuijin Li
collection DOAJ
description Abstract Automatically detecting and locating remote occlusion small objects from the images of complex traffic environments is a valuable and challenging research. Since the boundary box location is not sufficiently accurate and it is difficult to distinguish overlapping and occluded objects, the authors propose a network model with a second‐order term attention mechanism and occlusion loss. First, the backbone network is built on CSPDarkNet53. Then a method is designed for the feature extraction network based on an item‐wise attention mechanism, which uses the filtered weighted feature vector to replace the original residual fusion and adds a second‐order term to reduce the information loss in the process of fusion and accelerate the convergence of the model. Finally, an objected occlusion regression loss function is studied to reduce the problems of missed detections caused by dense objects. Sufficient experimental results demonstrate that the authors’ method achieved state‐of‐the‐art performance without reducing the detection speed. The mAP@.5 of the method is 85.8% on the Foggy_cityscapes dataset and the mAP@.5 of the method is 97.8% on the KITTI dataset.
first_indexed 2024-04-24T07:45:20Z
format Article
id doaj.art-1033f408a0894c8090a7115f37821d1e
institution Directory Open Access Journal
issn 2468-2322
language English
last_indexed 2024-04-24T07:45:20Z
publishDate 2024-04-01
publisher Wiley
record_format Article
series CAAI Transactions on Intelligence Technology
spelling doaj.art-1033f408a0894c8090a7115f37821d1e2024-04-19T03:11:28ZengWileyCAAI Transactions on Intelligence Technology2468-23222024-04-019241142410.1049/cit2.12236An object detection approach with residual feature fusion and second‐order term attention mechanismCuijin Li0Zhong Qu1Shengye Wang2College of Computer Science and Technology Chongqing University of Posts and Telecommunications Chongqing ChinaCollege of Computer Science and Technology Chongqing University of Posts and Telecommunications Chongqing ChinaCollege of Computer Science and Technology Chongqing University of Posts and Telecommunications Chongqing ChinaAbstract Automatically detecting and locating remote occlusion small objects from the images of complex traffic environments is a valuable and challenging research. Since the boundary box location is not sufficiently accurate and it is difficult to distinguish overlapping and occluded objects, the authors propose a network model with a second‐order term attention mechanism and occlusion loss. First, the backbone network is built on CSPDarkNet53. Then a method is designed for the feature extraction network based on an item‐wise attention mechanism, which uses the filtered weighted feature vector to replace the original residual fusion and adds a second‐order term to reduce the information loss in the process of fusion and accelerate the convergence of the model. Finally, an objected occlusion regression loss function is studied to reduce the problems of missed detections caused by dense objects. Sufficient experimental results demonstrate that the authors’ method achieved state‐of‐the‐art performance without reducing the detection speed. The mAP@.5 of the method is 85.8% on the Foggy_cityscapes dataset and the mAP@.5 of the method is 97.8% on the KITTI dataset.https://doi.org/10.1049/cit2.12236artificial intelligencecomputer visionimage processingmachine learningneural networkobject recognition
spellingShingle Cuijin Li
Zhong Qu
Shengye Wang
An object detection approach with residual feature fusion and second‐order term attention mechanism
CAAI Transactions on Intelligence Technology
artificial intelligence
computer vision
image processing
machine learning
neural network
object recognition
title An object detection approach with residual feature fusion and second‐order term attention mechanism
title_full An object detection approach with residual feature fusion and second‐order term attention mechanism
title_fullStr An object detection approach with residual feature fusion and second‐order term attention mechanism
title_full_unstemmed An object detection approach with residual feature fusion and second‐order term attention mechanism
title_short An object detection approach with residual feature fusion and second‐order term attention mechanism
title_sort object detection approach with residual feature fusion and second order term attention mechanism
topic artificial intelligence
computer vision
image processing
machine learning
neural network
object recognition
url https://doi.org/10.1049/cit2.12236
work_keys_str_mv AT cuijinli anobjectdetectionapproachwithresidualfeaturefusionandsecondordertermattentionmechanism
AT zhongqu anobjectdetectionapproachwithresidualfeaturefusionandsecondordertermattentionmechanism
AT shengyewang anobjectdetectionapproachwithresidualfeaturefusionandsecondordertermattentionmechanism
AT cuijinli objectdetectionapproachwithresidualfeaturefusionandsecondordertermattentionmechanism
AT zhongqu objectdetectionapproachwithresidualfeaturefusionandsecondordertermattentionmechanism
AT shengyewang objectdetectionapproachwithresidualfeaturefusionandsecondordertermattentionmechanism