Improved Feature Extraction and Similarity Algorithm for Video Object Detection

Video object detection is an important research direction of computer vision. The task of video object detection is to detect and classify moving objects in a sequence of images. Based on the static image object detector, most of the existing video object detection methods use the unique temporal co...

Full description

Bibliographic Details
Main Authors: Haotian You, Yufang Lu, Haihua Tang
Format: Article
Language:English
Published: MDPI AG 2023-02-01
Series:Information
Subjects:
Online Access:https://www.mdpi.com/2078-2489/14/2/115
_version_ 1797620367881142272
author Haotian You
Yufang Lu
Haihua Tang
author_facet Haotian You
Yufang Lu
Haihua Tang
author_sort Haotian You
collection DOAJ
description Video object detection is an important research direction of computer vision. The task of video object detection is to detect and classify moving objects in a sequence of images. Based on the static image object detector, most of the existing video object detection methods use the unique temporal correlation of video to solve the problem of missed detection and false detection caused by moving object occlusion and blur. Another video object detection model guided by an optical flow network is widely used. Feature aggregation of adjacent frames is performed by estimating the optical flow field. However, there are many redundant computations for feature aggregation of adjacent frames. To begin with, this paper improved Faster RCNN by Feature Pyramid and Dynamic Region Aware Convolution. Then the S-SELSA module is proposed from the perspective of semantic and feature similarity. Feature similarity is obtained by a modified SSIM algorithm. The module can aggregate the features of frames globally to avoid redundancy. Finally, the experimental results on the ImageNet VID and DET datasets show that the mAP of the method proposed in this paper is 83.55%, which is higher than the existing methods.
first_indexed 2024-03-11T08:40:16Z
format Article
id doaj.art-f2254d4e7a5641149313343c48851cec
institution Directory Open Access Journal
issn 2078-2489
language English
last_indexed 2024-03-11T08:40:16Z
publishDate 2023-02-01
publisher MDPI AG
record_format Article
series Information
spelling doaj.art-f2254d4e7a5641149313343c48851cec2023-11-16T21:12:31ZengMDPI AGInformation2078-24892023-02-0114211510.3390/info14020115Improved Feature Extraction and Similarity Algorithm for Video Object DetectionHaotian You0Yufang Lu1Haihua Tang2School of Information Science and Engineering, Guilin University of Technology, Guilin 541004, ChinaSchool of Information Science and Engineering, Guilin University of Technology, Guilin 541004, ChinaSchool of Information Science and Engineering, Guilin University of Technology, Guilin 541004, ChinaVideo object detection is an important research direction of computer vision. The task of video object detection is to detect and classify moving objects in a sequence of images. Based on the static image object detector, most of the existing video object detection methods use the unique temporal correlation of video to solve the problem of missed detection and false detection caused by moving object occlusion and blur. Another video object detection model guided by an optical flow network is widely used. Feature aggregation of adjacent frames is performed by estimating the optical flow field. However, there are many redundant computations for feature aggregation of adjacent frames. To begin with, this paper improved Faster RCNN by Feature Pyramid and Dynamic Region Aware Convolution. Then the S-SELSA module is proposed from the perspective of semantic and feature similarity. Feature similarity is obtained by a modified SSIM algorithm. The module can aggregate the features of frames globally to avoid redundancy. Finally, the experimental results on the ImageNet VID and DET datasets show that the mAP of the method proposed in this paper is 83.55%, which is higher than the existing methods.https://www.mdpi.com/2078-2489/14/2/115video object detectionfaster RCNNfeature pyramidsimilarity algorithmsdynamic region aware convolution
spellingShingle Haotian You
Yufang Lu
Haihua Tang
Improved Feature Extraction and Similarity Algorithm for Video Object Detection
Information
video object detection
faster RCNN
feature pyramid
similarity algorithms
dynamic region aware convolution
title Improved Feature Extraction and Similarity Algorithm for Video Object Detection
title_full Improved Feature Extraction and Similarity Algorithm for Video Object Detection
title_fullStr Improved Feature Extraction and Similarity Algorithm for Video Object Detection
title_full_unstemmed Improved Feature Extraction and Similarity Algorithm for Video Object Detection
title_short Improved Feature Extraction and Similarity Algorithm for Video Object Detection
title_sort improved feature extraction and similarity algorithm for video object detection
topic video object detection
faster RCNN
feature pyramid
similarity algorithms
dynamic region aware convolution
url https://www.mdpi.com/2078-2489/14/2/115
work_keys_str_mv AT haotianyou improvedfeatureextractionandsimilarityalgorithmforvideoobjectdetection
AT yufanglu improvedfeatureextractionandsimilarityalgorithmforvideoobjectdetection
AT haihuatang improvedfeatureextractionandsimilarityalgorithmforvideoobjectdetection