Straight to Shapes: Real-time Detection of Encoded Shapes

Current object detection approaches predict bounding boxes that provide little instance-specific information beyond location, scale and aspect ratio. In this work, we propose to regress directly to objects’ shapes in addition to their bounding boxes and categories. It is crucial to find an appropria...

Descripció completa

Dades bibliogràfiques
Autors principals: Jetley, S, Sapienza, M, Golodetz, S, Torr, P
Format: Conference item
Publicat: Institute of Electrical and Electronics Engineers 2017
_version_ 1826262241379352576
author Jetley, S
Sapienza, M
Golodetz, S
Torr, P
author_facet Jetley, S
Sapienza, M
Golodetz, S
Torr, P
author_sort Jetley, S
collection OXFORD
description Current object detection approaches predict bounding boxes that provide little instance-specific information beyond location, scale and aspect ratio. In this work, we propose to regress directly to objects’ shapes in addition to their bounding boxes and categories. It is crucial to find an appropriate shape representation that is compact and decodable, and in which objects can be compared for higher order concepts such as view similarity, pose variation and occlusion. To achieve this, we use a denoising convolutional auto-encoder to learn a low-dimensional shape embedding space. We place the decoder network after a fast end-to end deep convolutional network that is trained to regress directly to the shape vectors provided by the auto-encoder. This yields what to the best of our knowledge is the first real-time shape prediction network, running at 35 FPS on a high-end desktop. With higher-order shape reasoning well integrated into the network pipeline, the network shows the useful practical quality of generalising to unseen categories that are similar to the ones in the training set, something that most existing approaches fail to handle.
first_indexed 2024-03-06T19:33:18Z
format Conference item
id oxford-uuid:1e312239-8533-4670-a2f6-ca1b4b1082f6
institution University of Oxford
last_indexed 2024-03-06T19:33:18Z
publishDate 2017
publisher Institute of Electrical and Electronics Engineers
record_format dspace
spelling oxford-uuid:1e312239-8533-4670-a2f6-ca1b4b1082f62022-03-26T11:15:01ZStraight to Shapes: Real-time Detection of Encoded ShapesConference itemhttp://purl.org/coar/resource_type/c_1843uuid:1e312239-8533-4670-a2f6-ca1b4b1082f6Symplectic Elements at OxfordInstitute of Electrical and Electronics Engineers2017Jetley, SSapienza, MGolodetz, STorr, PCurrent object detection approaches predict bounding boxes that provide little instance-specific information beyond location, scale and aspect ratio. In this work, we propose to regress directly to objects’ shapes in addition to their bounding boxes and categories. It is crucial to find an appropriate shape representation that is compact and decodable, and in which objects can be compared for higher order concepts such as view similarity, pose variation and occlusion. To achieve this, we use a denoising convolutional auto-encoder to learn a low-dimensional shape embedding space. We place the decoder network after a fast end-to end deep convolutional network that is trained to regress directly to the shape vectors provided by the auto-encoder. This yields what to the best of our knowledge is the first real-time shape prediction network, running at 35 FPS on a high-end desktop. With higher-order shape reasoning well integrated into the network pipeline, the network shows the useful practical quality of generalising to unseen categories that are similar to the ones in the training set, something that most existing approaches fail to handle.
spellingShingle Jetley, S
Sapienza, M
Golodetz, S
Torr, P
Straight to Shapes: Real-time Detection of Encoded Shapes
title Straight to Shapes: Real-time Detection of Encoded Shapes
title_full Straight to Shapes: Real-time Detection of Encoded Shapes
title_fullStr Straight to Shapes: Real-time Detection of Encoded Shapes
title_full_unstemmed Straight to Shapes: Real-time Detection of Encoded Shapes
title_short Straight to Shapes: Real-time Detection of Encoded Shapes
title_sort straight to shapes real time detection of encoded shapes
work_keys_str_mv AT jetleys straighttoshapesrealtimedetectionofencodedshapes
AT sapienzam straighttoshapesrealtimedetectionofencodedshapes
AT golodetzs straighttoshapesrealtimedetectionofencodedshapes
AT torrp straighttoshapesrealtimedetectionofencodedshapes