Multiscale anchor box and optimized classification with faster R‐CNN for object detection

Abstract For the two‐stage object detector as a faster region‐convolutional neural network (Faster R‐CNN), upgrading the accuracy of object recognition depends on the proposal box, which is generated by the region proposal algorithms. Due to the limitations of the anchor setting of Faster RCNN, the...

Full description

Bibliographic Details
Main Authors: Sheng‐Ye Wang, Zhong Qu
Format: Article
Language:English
Published: Wiley 2023-04-01
Series:IET Image Processing
Subjects:
Online Access:https://doi.org/10.1049/ipr2.12714
_version_ 1797852204806176768
author Sheng‐Ye Wang
Zhong Qu
author_facet Sheng‐Ye Wang
Zhong Qu
author_sort Sheng‐Ye Wang
collection DOAJ
description Abstract For the two‐stage object detector as a faster region‐convolutional neural network (Faster R‐CNN), upgrading the accuracy of object recognition depends on the proposal box, which is generated by the region proposal algorithms. Due to the limitations of the anchor setting of Faster RCNN, the size of the proposal box generated by the region proposal network (RPN) used is large, which would easily cause a great number of overflows in the sliding search. To improve the accuracy of object detection and remit the overflow problem of the anchor box, multi‐scale anchor box and moving overflow anchor box strategies are introduced here. Then, to increase the positive sample range of the foreground, the hierarchical weight cross entropy classification function is set for binary classification in the RPN network. These strategies could improve the accuracy of object detection. The experimental result achieves 76.2% AP on the Pascal VOC 2007(VOC 07) dataset, which is 2.7% higher than the Faster R‐CNN. The result of the Pascal VOC 2012(VOC 12) test, we achieve 75.6% AP, is improved by 2.5% compared with the Faster R‐CNN.
first_indexed 2024-04-09T19:30:05Z
format Article
id doaj.art-8536b6b5ccac4024933afdc62368ff65
institution Directory Open Access Journal
issn 1751-9659
1751-9667
language English
last_indexed 2024-04-09T19:30:05Z
publishDate 2023-04-01
publisher Wiley
record_format Article
series IET Image Processing
spelling doaj.art-8536b6b5ccac4024933afdc62368ff652023-04-05T04:26:01ZengWileyIET Image Processing1751-96591751-96672023-04-011751322133310.1049/ipr2.12714Multiscale anchor box and optimized classification with faster R‐CNN for object detectionSheng‐Ye Wang0Zhong Qu1College of Computer Science and Technology Chongqing University of Posts and Telecommunications Nan'an District Chongqing People's Republic of ChinaCollege of Computer Science and Technology Chongqing University of Posts and Telecommunications Nan'an District Chongqing People's Republic of ChinaAbstract For the two‐stage object detector as a faster region‐convolutional neural network (Faster R‐CNN), upgrading the accuracy of object recognition depends on the proposal box, which is generated by the region proposal algorithms. Due to the limitations of the anchor setting of Faster RCNN, the size of the proposal box generated by the region proposal network (RPN) used is large, which would easily cause a great number of overflows in the sliding search. To improve the accuracy of object detection and remit the overflow problem of the anchor box, multi‐scale anchor box and moving overflow anchor box strategies are introduced here. Then, to increase the positive sample range of the foreground, the hierarchical weight cross entropy classification function is set for binary classification in the RPN network. These strategies could improve the accuracy of object detection. The experimental result achieves 76.2% AP on the Pascal VOC 2007(VOC 07) dataset, which is 2.7% higher than the Faster R‐CNN. The result of the Pascal VOC 2012(VOC 12) test, we achieve 75.6% AP, is improved by 2.5% compared with the Faster R‐CNN.https://doi.org/10.1049/ipr2.12714image processingimage recognitionobject detection
spellingShingle Sheng‐Ye Wang
Zhong Qu
Multiscale anchor box and optimized classification with faster R‐CNN for object detection
IET Image Processing
image processing
image recognition
object detection
title Multiscale anchor box and optimized classification with faster R‐CNN for object detection
title_full Multiscale anchor box and optimized classification with faster R‐CNN for object detection
title_fullStr Multiscale anchor box and optimized classification with faster R‐CNN for object detection
title_full_unstemmed Multiscale anchor box and optimized classification with faster R‐CNN for object detection
title_short Multiscale anchor box and optimized classification with faster R‐CNN for object detection
title_sort multiscale anchor box and optimized classification with faster r cnn for object detection
topic image processing
image recognition
object detection
url https://doi.org/10.1049/ipr2.12714
work_keys_str_mv AT shengyewang multiscaleanchorboxandoptimizedclassificationwithfasterrcnnforobjectdetection
AT zhongqu multiscaleanchorboxandoptimizedclassificationwithfasterrcnnforobjectdetection