Multiscale anchor box and optimized classification with faster R‐CNN for object detection
Abstract For the two‐stage object detector as a faster region‐convolutional neural network (Faster R‐CNN), upgrading the accuracy of object recognition depends on the proposal box, which is generated by the region proposal algorithms. Due to the limitations of the anchor setting of Faster RCNN, the...
Main Authors: | , |
---|---|
Format: | Article |
Language: | English |
Published: |
Wiley
2023-04-01
|
Series: | IET Image Processing |
Subjects: | |
Online Access: | https://doi.org/10.1049/ipr2.12714 |
_version_ | 1797852204806176768 |
---|---|
author | Sheng‐Ye Wang Zhong Qu |
author_facet | Sheng‐Ye Wang Zhong Qu |
author_sort | Sheng‐Ye Wang |
collection | DOAJ |
description | Abstract For the two‐stage object detector as a faster region‐convolutional neural network (Faster R‐CNN), upgrading the accuracy of object recognition depends on the proposal box, which is generated by the region proposal algorithms. Due to the limitations of the anchor setting of Faster RCNN, the size of the proposal box generated by the region proposal network (RPN) used is large, which would easily cause a great number of overflows in the sliding search. To improve the accuracy of object detection and remit the overflow problem of the anchor box, multi‐scale anchor box and moving overflow anchor box strategies are introduced here. Then, to increase the positive sample range of the foreground, the hierarchical weight cross entropy classification function is set for binary classification in the RPN network. These strategies could improve the accuracy of object detection. The experimental result achieves 76.2% AP on the Pascal VOC 2007(VOC 07) dataset, which is 2.7% higher than the Faster R‐CNN. The result of the Pascal VOC 2012(VOC 12) test, we achieve 75.6% AP, is improved by 2.5% compared with the Faster R‐CNN. |
first_indexed | 2024-04-09T19:30:05Z |
format | Article |
id | doaj.art-8536b6b5ccac4024933afdc62368ff65 |
institution | Directory Open Access Journal |
issn | 1751-9659 1751-9667 |
language | English |
last_indexed | 2024-04-09T19:30:05Z |
publishDate | 2023-04-01 |
publisher | Wiley |
record_format | Article |
series | IET Image Processing |
spelling | doaj.art-8536b6b5ccac4024933afdc62368ff652023-04-05T04:26:01ZengWileyIET Image Processing1751-96591751-96672023-04-011751322133310.1049/ipr2.12714Multiscale anchor box and optimized classification with faster R‐CNN for object detectionSheng‐Ye Wang0Zhong Qu1College of Computer Science and Technology Chongqing University of Posts and Telecommunications Nan'an District Chongqing People's Republic of ChinaCollege of Computer Science and Technology Chongqing University of Posts and Telecommunications Nan'an District Chongqing People's Republic of ChinaAbstract For the two‐stage object detector as a faster region‐convolutional neural network (Faster R‐CNN), upgrading the accuracy of object recognition depends on the proposal box, which is generated by the region proposal algorithms. Due to the limitations of the anchor setting of Faster RCNN, the size of the proposal box generated by the region proposal network (RPN) used is large, which would easily cause a great number of overflows in the sliding search. To improve the accuracy of object detection and remit the overflow problem of the anchor box, multi‐scale anchor box and moving overflow anchor box strategies are introduced here. Then, to increase the positive sample range of the foreground, the hierarchical weight cross entropy classification function is set for binary classification in the RPN network. These strategies could improve the accuracy of object detection. The experimental result achieves 76.2% AP on the Pascal VOC 2007(VOC 07) dataset, which is 2.7% higher than the Faster R‐CNN. The result of the Pascal VOC 2012(VOC 12) test, we achieve 75.6% AP, is improved by 2.5% compared with the Faster R‐CNN.https://doi.org/10.1049/ipr2.12714image processingimage recognitionobject detection |
spellingShingle | Sheng‐Ye Wang Zhong Qu Multiscale anchor box and optimized classification with faster R‐CNN for object detection IET Image Processing image processing image recognition object detection |
title | Multiscale anchor box and optimized classification with faster R‐CNN for object detection |
title_full | Multiscale anchor box and optimized classification with faster R‐CNN for object detection |
title_fullStr | Multiscale anchor box and optimized classification with faster R‐CNN for object detection |
title_full_unstemmed | Multiscale anchor box and optimized classification with faster R‐CNN for object detection |
title_short | Multiscale anchor box and optimized classification with faster R‐CNN for object detection |
title_sort | multiscale anchor box and optimized classification with faster r cnn for object detection |
topic | image processing image recognition object detection |
url | https://doi.org/10.1049/ipr2.12714 |
work_keys_str_mv | AT shengyewang multiscaleanchorboxandoptimizedclassificationwithfasterrcnnforobjectdetection AT zhongqu multiscaleanchorboxandoptimizedclassificationwithfasterrcnnforobjectdetection |