SFGNet detecting objects via spatial fine-grained feature and enhanced RPN with spatial context

Object detection, which is one of the most fundamental visual recognition tasks, has been a hotspot in computer vision. CNN (Convolutional Neural Networks) have been widely employed for building detector. Due to the success of RPN (Region Proposal Network), the two-stage detectors get both classific...

Full description

Bibliographic Details
Main Authors: Jun Hu, Yongfeng Wang, Shuai Cheng, Jiaxin Liu, Jiawen Kang, Wenxing Yang
Format: Article
Language:English
Published: Taylor & Francis Group 2022-12-01
Series:Systems Science & Control Engineering
Subjects:
Online Access:https://www.tandfonline.com/doi/10.1080/21642583.2022.2062479
_version_ 1818188438259630080
author Jun Hu
Yongfeng Wang
Shuai Cheng
Jiaxin Liu
Jiawen Kang
Wenxing Yang
author_facet Jun Hu
Yongfeng Wang
Shuai Cheng
Jiaxin Liu
Jiawen Kang
Wenxing Yang
author_sort Jun Hu
collection DOAJ
description Object detection, which is one of the most fundamental visual recognition tasks, has been a hotspot in computer vision. CNN (Convolutional Neural Networks) have been widely employed for building detector. Due to the success of RPN (Region Proposal Network), the two-stage detectors get both classification accuracy and precise regression bounding boxes. However, they still struggle in small-size object detection. In this paper, we present a deep network, namely Spatial Fine-Grained Network (SFGN). The SFGN that exploits Spatial Fine-Grained Features (SFGF) concatenates the higher resolution features, which is fine-grained with the low resolution features and high-level semantic by stacking spatial features for fine-grained features. An enhanced region proposal generator is proposed to get the objectless for small object to obtain a small set of proposal. The contextual information surrounding the region of interest is embedded using local spatial information for increasing the useful information and discriminating the background. For improving the detection performance, we use a simple yet surprisingly effective online hard example mining (OHEM) algorithm for training region proposal generator. It embeds an efficiently implemented soft non-maximum suppression (soft-NMS) for replacing with tradition NMS to obtain consistent improvements without increasing the computational complexity in inference. On PASCAL VOC 2007 and PASCAL VOC 2012 datasets, our SFGN improves baseline model from 81.2% mAP to 80.6% mAP. On MS COCO dataset, SFGN also performs better than baseline model. As intuition suggests, our detection results provide strong evidence that our SFGN improves detection accuracy, especially in small object test.
first_indexed 2024-12-11T23:26:55Z
format Article
id doaj.art-6ffc9fc8834e43139739b57865b0487b
institution Directory Open Access Journal
issn 2164-2583
language English
last_indexed 2024-12-11T23:26:55Z
publishDate 2022-12-01
publisher Taylor & Francis Group
record_format Article
series Systems Science & Control Engineering
spelling doaj.art-6ffc9fc8834e43139739b57865b0487b2022-12-22T00:46:09ZengTaylor & Francis GroupSystems Science & Control Engineering2164-25832022-12-0110138840610.1080/21642583.2022.2062479SFGNet detecting objects via spatial fine-grained feature and enhanced RPN with spatial contextJun Hu0Yongfeng Wang1Shuai Cheng2Jiaxin Liu3Jiawen Kang4Wenxing Yang5School of Computer Science and Engineering, Northeastern University, Liaoning, People's Republic of ChinaNeusoft Reachauto Corporation, Liaoning, People's Republic of ChinaNeusoft Reachauto Corporation, Liaoning, People's Republic of ChinaSchool of Computer Science and Engineering, Northeastern University, Liaoning, People's Republic of ChinaSchool of Computer Science and Engineering, Northeastern University, Liaoning, People's Republic of ChinaSchool of Computer Science and Engineering, Northeastern University, Liaoning, People's Republic of ChinaObject detection, which is one of the most fundamental visual recognition tasks, has been a hotspot in computer vision. CNN (Convolutional Neural Networks) have been widely employed for building detector. Due to the success of RPN (Region Proposal Network), the two-stage detectors get both classification accuracy and precise regression bounding boxes. However, they still struggle in small-size object detection. In this paper, we present a deep network, namely Spatial Fine-Grained Network (SFGN). The SFGN that exploits Spatial Fine-Grained Features (SFGF) concatenates the higher resolution features, which is fine-grained with the low resolution features and high-level semantic by stacking spatial features for fine-grained features. An enhanced region proposal generator is proposed to get the objectless for small object to obtain a small set of proposal. The contextual information surrounding the region of interest is embedded using local spatial information for increasing the useful information and discriminating the background. For improving the detection performance, we use a simple yet surprisingly effective online hard example mining (OHEM) algorithm for training region proposal generator. It embeds an efficiently implemented soft non-maximum suppression (soft-NMS) for replacing with tradition NMS to obtain consistent improvements without increasing the computational complexity in inference. On PASCAL VOC 2007 and PASCAL VOC 2012 datasets, our SFGN improves baseline model from 81.2% mAP to 80.6% mAP. On MS COCO dataset, SFGN also performs better than baseline model. As intuition suggests, our detection results provide strong evidence that our SFGN improves detection accuracy, especially in small object test.https://www.tandfonline.com/doi/10.1080/21642583.2022.2062479Convolutional neural networkssmall-size object detectionspatial fine-grained featuresonline hard example mining
spellingShingle Jun Hu
Yongfeng Wang
Shuai Cheng
Jiaxin Liu
Jiawen Kang
Wenxing Yang
SFGNet detecting objects via spatial fine-grained feature and enhanced RPN with spatial context
Systems Science & Control Engineering
Convolutional neural networks
small-size object detection
spatial fine-grained features
online hard example mining
title SFGNet detecting objects via spatial fine-grained feature and enhanced RPN with spatial context
title_full SFGNet detecting objects via spatial fine-grained feature and enhanced RPN with spatial context
title_fullStr SFGNet detecting objects via spatial fine-grained feature and enhanced RPN with spatial context
title_full_unstemmed SFGNet detecting objects via spatial fine-grained feature and enhanced RPN with spatial context
title_short SFGNet detecting objects via spatial fine-grained feature and enhanced RPN with spatial context
title_sort sfgnet detecting objects via spatial fine grained feature and enhanced rpn with spatial context
topic Convolutional neural networks
small-size object detection
spatial fine-grained features
online hard example mining
url https://www.tandfonline.com/doi/10.1080/21642583.2022.2062479
work_keys_str_mv AT junhu sfgnetdetectingobjectsviaspatialfinegrainedfeatureandenhancedrpnwithspatialcontext
AT yongfengwang sfgnetdetectingobjectsviaspatialfinegrainedfeatureandenhancedrpnwithspatialcontext
AT shuaicheng sfgnetdetectingobjectsviaspatialfinegrainedfeatureandenhancedrpnwithspatialcontext
AT jiaxinliu sfgnetdetectingobjectsviaspatialfinegrainedfeatureandenhancedrpnwithspatialcontext
AT jiawenkang sfgnetdetectingobjectsviaspatialfinegrainedfeatureandenhancedrpnwithspatialcontext
AT wenxingyang sfgnetdetectingobjectsviaspatialfinegrainedfeatureandenhancedrpnwithspatialcontext