An object detection approach with residual feature fusion and second‐order term attention mechanism

Abstract Automatically detecting and locating remote occlusion small objects from the images of complex traffic environments is a valuable and challenging research. Since the boundary box location is not sufficiently accurate and it is difficult to distinguish overlapping and occluded objects, the a...

Full description

Bibliographic Details
Main Authors:	Cuijin Li, Zhong Qu, Shengye Wang
Format:	Article
Language:	English
Published:	Wiley 2024-04-01
Series:	CAAI Transactions on Intelligence Technology
Subjects:	artificial intelligence computer vision image processing machine learning neural network object recognition
Online Access:	https://doi.org/10.1049/cit2.12236

_version_	1797201299468451840
author	Cuijin Li Zhong Qu Shengye Wang
author_facet	Cuijin Li Zhong Qu Shengye Wang
author_sort	Cuijin Li
collection	DOAJ
description	Abstract Automatically detecting and locating remote occlusion small objects from the images of complex traffic environments is a valuable and challenging research. Since the boundary box location is not sufficiently accurate and it is difficult to distinguish overlapping and occluded objects, the authors propose a network model with a second‐order term attention mechanism and occlusion loss. First, the backbone network is built on CSPDarkNet53. Then a method is designed for the feature extraction network based on an item‐wise attention mechanism, which uses the filtered weighted feature vector to replace the original residual fusion and adds a second‐order term to reduce the information loss in the process of fusion and accelerate the convergence of the model. Finally, an objected occlusion regression loss function is studied to reduce the problems of missed detections caused by dense objects. Sufficient experimental results demonstrate that the authors’ method achieved state‐of‐the‐art performance without reducing the detection speed. The mAP@.5 of the method is 85.8% on the Foggy_cityscapes dataset and the mAP@.5 of the method is 97.8% on the KITTI dataset.
first_indexed	2024-04-24T07:45:20Z
format	Article
id	doaj.art-1033f408a0894c8090a7115f37821d1e
institution	Directory Open Access Journal
issn	2468-2322
language	English
last_indexed	2024-04-24T07:45:20Z
publishDate	2024-04-01
publisher	Wiley
record_format	Article
series	CAAI Transactions on Intelligence Technology
spelling	doaj.art-1033f408a0894c8090a7115f37821d1e2024-04-19T03:11:28ZengWileyCAAI Transactions on Intelligence Technology2468-23222024-04-019241142410.1049/cit2.12236An object detection approach with residual feature fusion and second‐order term attention mechanismCuijin Li0Zhong Qu1Shengye Wang2College of Computer Science and Technology Chongqing University of Posts and Telecommunications Chongqing ChinaCollege of Computer Science and Technology Chongqing University of Posts and Telecommunications Chongqing ChinaCollege of Computer Science and Technology Chongqing University of Posts and Telecommunications Chongqing ChinaAbstract Automatically detecting and locating remote occlusion small objects from the images of complex traffic environments is a valuable and challenging research. Since the boundary box location is not sufficiently accurate and it is difficult to distinguish overlapping and occluded objects, the authors propose a network model with a second‐order term attention mechanism and occlusion loss. First, the backbone network is built on CSPDarkNet53. Then a method is designed for the feature extraction network based on an item‐wise attention mechanism, which uses the filtered weighted feature vector to replace the original residual fusion and adds a second‐order term to reduce the information loss in the process of fusion and accelerate the convergence of the model. Finally, an objected occlusion regression loss function is studied to reduce the problems of missed detections caused by dense objects. Sufficient experimental results demonstrate that the authors’ method achieved state‐of‐the‐art performance without reducing the detection speed. The mAP@.5 of the method is 85.8% on the Foggy_cityscapes dataset and the mAP@.5 of the method is 97.8% on the KITTI dataset.https://doi.org/10.1049/cit2.12236artificial intelligencecomputer visionimage processingmachine learningneural networkobject recognition
spellingShingle	Cuijin Li Zhong Qu Shengye Wang An object detection approach with residual feature fusion and second‐order term attention mechanism CAAI Transactions on Intelligence Technology artificial intelligence computer vision image processing machine learning neural network object recognition
title	An object detection approach with residual feature fusion and second‐order term attention mechanism
title_full	An object detection approach with residual feature fusion and second‐order term attention mechanism
title_fullStr	An object detection approach with residual feature fusion and second‐order term attention mechanism
title_full_unstemmed	An object detection approach with residual feature fusion and second‐order term attention mechanism
title_short	An object detection approach with residual feature fusion and second‐order term attention mechanism
title_sort	object detection approach with residual feature fusion and second order term attention mechanism
topic	artificial intelligence computer vision image processing machine learning neural network object recognition
url	https://doi.org/10.1049/cit2.12236
work_keys_str_mv	AT cuijinli anobjectdetectionapproachwithresidualfeaturefusionandsecondordertermattentionmechanism AT zhongqu anobjectdetectionapproachwithresidualfeaturefusionandsecondordertermattentionmechanism AT shengyewang anobjectdetectionapproachwithresidualfeaturefusionandsecondordertermattentionmechanism AT cuijinli objectdetectionapproachwithresidualfeaturefusionandsecondordertermattentionmechanism AT zhongqu objectdetectionapproachwithresidualfeaturefusionandsecondordertermattentionmechanism AT shengyewang objectdetectionapproachwithresidualfeaturefusionandsecondordertermattentionmechanism

An object detection approach with residual feature fusion and second‐order term attention mechanism

Similar Items