A tri-layer plugin to improve occluded detection

Detecting occluded objects still remains a challenge for state-of-the-art object detectors. The objective of this work is to improve the detection for such objects, and thereby improve the overall performance of a modern object detector. To this end we make the following four contributions: (1) We p...

Full description

Bibliographic Details
Main Authors:	Zhan, G, Xie, W, Zisserman, A
Format:	Conference item
Language:	English
Published:	BMVA Press 2022

_version_	1826314267875344384
author	Zhan, G Xie, W Zisserman, A
author_facet	Zhan, G Xie, W Zisserman, A
author_sort	Zhan, G
collection	OXFORD
description	Detecting occluded objects still remains a challenge for state-of-the-art object detectors. The objective of this work is to improve the detection for such objects, and thereby improve the overall performance of a modern object detector. To this end we make the following four contributions: (1) We propose a simple `plugin' module for the detection head of two-stage object detectors to improve the recall of partially occluded objects. The module predicts a tri-layer of segmentation masks for the target object, the occluder and the occludee, and by doing so is able to better predict the mask of the target object. (2) We propose a scalable pipeline for generating training data for the module by using amodal completion of existing object detection and instance segmentation training datasets to establish occlusion relationships. (3) We also establish a COCO evaluation dataset to measure the recall performance of partially occluded and separated objects. (4) We show that the plugin module inserted into a two-stage detector can boost the performance significantly, by only fine-tuning the detection head, and with additional improvements if the entire architecture is fine-tuned. COCO results are reported for Mask R-CNN with Swin-T or Swin-S backbones, and Cascade Mask R-CNN with a Swin-B backbone.
first_indexed	2024-03-07T07:29:20Z
format	Conference item
id	oxford-uuid:ba561358-f8e2-4715-bdbf-1b3939b9a97b
institution	University of Oxford
language	English
last_indexed	2024-09-25T04:29:58Z
publishDate	2022
publisher	BMVA Press
record_format	dspace
spelling	oxford-uuid:ba561358-f8e2-4715-bdbf-1b3939b9a97b2024-08-27T10:17:16ZA tri-layer plugin to improve occluded detectionConference itemhttp://purl.org/coar/resource_type/c_5794uuid:ba561358-f8e2-4715-bdbf-1b3939b9a97bEnglishSymplectic ElementsBMVA Press2022Zhan, GXie, WZisserman, ADetecting occluded objects still remains a challenge for state-of-the-art object detectors. The objective of this work is to improve the detection for such objects, and thereby improve the overall performance of a modern object detector. To this end we make the following four contributions: (1) We propose a simple `plugin' module for the detection head of two-stage object detectors to improve the recall of partially occluded objects. The module predicts a tri-layer of segmentation masks for the target object, the occluder and the occludee, and by doing so is able to better predict the mask of the target object. (2) We propose a scalable pipeline for generating training data for the module by using amodal completion of existing object detection and instance segmentation training datasets to establish occlusion relationships. (3) We also establish a COCO evaluation dataset to measure the recall performance of partially occluded and separated objects. (4) We show that the plugin module inserted into a two-stage detector can boost the performance significantly, by only fine-tuning the detection head, and with additional improvements if the entire architecture is fine-tuned. COCO results are reported for Mask R-CNN with Swin-T or Swin-S backbones, and Cascade Mask R-CNN with a Swin-B backbone.
spellingShingle	Zhan, G Xie, W Zisserman, A A tri-layer plugin to improve occluded detection
title	A tri-layer plugin to improve occluded detection
title_full	A tri-layer plugin to improve occluded detection
title_fullStr	A tri-layer plugin to improve occluded detection
title_full_unstemmed	A tri-layer plugin to improve occluded detection
title_short	A tri-layer plugin to improve occluded detection
title_sort	tri layer plugin to improve occluded detection
work_keys_str_mv	AT zhang atrilayerplugintoimproveoccludeddetection AT xiew atrilayerplugintoimproveoccludeddetection AT zissermana atrilayerplugintoimproveoccludeddetection AT zhang trilayerplugintoimproveoccludeddetection AT xiew trilayerplugintoimproveoccludeddetection AT zissermana trilayerplugintoimproveoccludeddetection

A tri-layer plugin to improve occluded detection

Similar Items