Humanising GrabCut: learning to segment humans using the Kinect

The Kinect provides an opportunity to collect large quantities of training data for visual learning algorithms relatively effortlessly. To this end we investigate learning to automatically segment humans from cluttered images (without depth information) given a bounding box. For this algorithm, obta...

Full description

Bibliographic Details
Main Authors:	Gulshan, V, Lempitsky, V, Zisserman, A
Format:	Conference item
Language:	English
Published:	IEEE 2012

_version_	1824458761046065152
author	Gulshan, V Lempitsky, V Zisserman, A
author_facet	Gulshan, V Lempitsky, V Zisserman, A
author_sort	Gulshan, V
collection	OXFORD
description	The Kinect provides an opportunity to collect large quantities of training data for visual learning algorithms relatively effortlessly. To this end we investigate learning to automatically segment humans from cluttered images (without depth information) given a bounding box. For this algorithm, obtaining a large dataset of images with segmented humans is crucial as it enables the possible variations in human appearances and backgrounds to be learnt. We show that a large dataset of roughly 3400 humans can be automatically acquired very cheaply using the Kinect. Segmenting humans is then cast as a learning problem with linear classifiers trained to predict segmentation masks from sparsely coded local HOG descriptors. These classifiers introduce top-down knowledge to obtain a crude segmentation of the human which is then refined using bottom up information from local color models in a Snap-Cut [2] like fashion. The method is quantitatively evaluated on images of humans in cluttered scenes, and a high performance obtained (88:5% overlap score). We also show that the method can be completely automated - segmenting humans given only the images, without requiring a bounding box, and compare with a previous state of the art method.
first_indexed	2025-02-19T04:31:01Z
format	Conference item
id	oxford-uuid:0f0cb72e-ddd1-4e46-bd0f-b5dc20de2423
institution	University of Oxford
language	English
last_indexed	2025-02-19T04:31:01Z
publishDate	2012
publisher	IEEE
record_format	dspace
spelling	oxford-uuid:0f0cb72e-ddd1-4e46-bd0f-b5dc20de24232025-01-09T12:30:13ZHumanising GrabCut: learning to segment humans using the KinectConference itemhttp://purl.org/coar/resource_type/c_5794uuid:0f0cb72e-ddd1-4e46-bd0f-b5dc20de2423EnglishSymplectic ElementsIEEE2012Gulshan, VLempitsky, VZisserman, AThe Kinect provides an opportunity to collect large quantities of training data for visual learning algorithms relatively effortlessly. To this end we investigate learning to automatically segment humans from cluttered images (without depth information) given a bounding box. For this algorithm, obtaining a large dataset of images with segmented humans is crucial as it enables the possible variations in human appearances and backgrounds to be learnt. We show that a large dataset of roughly 3400 humans can be automatically acquired very cheaply using the Kinect. Segmenting humans is then cast as a learning problem with linear classifiers trained to predict segmentation masks from sparsely coded local HOG descriptors. These classifiers introduce top-down knowledge to obtain a crude segmentation of the human which is then refined using bottom up information from local color models in a Snap-Cut [2] like fashion. The method is quantitatively evaluated on images of humans in cluttered scenes, and a high performance obtained (88:5% overlap score). We also show that the method can be completely automated - segmenting humans given only the images, without requiring a bounding box, and compare with a previous state of the art method.
spellingShingle	Gulshan, V Lempitsky, V Zisserman, A Humanising GrabCut: learning to segment humans using the Kinect
title	Humanising GrabCut: learning to segment humans using the Kinect
title_full	Humanising GrabCut: learning to segment humans using the Kinect
title_fullStr	Humanising GrabCut: learning to segment humans using the Kinect
title_full_unstemmed	Humanising GrabCut: learning to segment humans using the Kinect
title_short	Humanising GrabCut: learning to segment humans using the Kinect
title_sort	humanising grabcut learning to segment humans using the kinect
work_keys_str_mv	AT gulshanv humanisinggrabcutlearningtosegmenthumansusingthekinect AT lempitskyv humanisinggrabcutlearningtosegmenthumansusingthekinect AT zissermana humanisinggrabcutlearningtosegmenthumansusingthekinect

Humanising GrabCut: learning to segment humans using the Kinect

Similar Items