Structured output regression for detection with partial truncation

We develop a structured output model for object category detection that explicitly accounts for alignment, multiple aspects and partial truncation in both training and inference. The model is formulated as large margin learning with latent variables and slack rescaling, and both training and inferen...

Full description

Bibliographic Details
Main Authors: Vedaldi, A, Zisserman, A
Format: Journal article
Language:English
Published: 2009
_version_ 1826259717848039424
author Vedaldi, A
Zisserman, A
author_facet Vedaldi, A
Zisserman, A
author_sort Vedaldi, A
collection OXFORD
description We develop a structured output model for object category detection that explicitly accounts for alignment, multiple aspects and partial truncation in both training and inference. The model is formulated as large margin learning with latent variables and slack rescaling, and both training and inference are computationally efficient. We make the following contributions: (i) we note that extending the Structured Output Regression formulation of Blaschko and Lampert [1] to include a bias term significantly improves performance; (ii) that alignment (to account for small rotations and anisotropic scalings) can be included as a latent variable and efficiently determined and implemented; (iii) that the latent variable extends to multiple aspects (e.g. left facing, right facing, front) with the same formulation; and (iv), most significantly for performance, that truncated and truncated instances can be included in both training and inference with an explicit truncation mask. We demonstrate the method by training and testing on the PASCAL VOC 2007 data set - training includes the truncated examples, and in testing object instances are detected at multiple scales, alignments, and with significant truncations.
first_indexed 2024-03-06T18:54:13Z
format Journal article
id oxford-uuid:1141203e-5718-4380-b532-7791941ea15c
institution University of Oxford
language English
last_indexed 2024-03-06T18:54:13Z
publishDate 2009
record_format dspace
spelling oxford-uuid:1141203e-5718-4380-b532-7791941ea15c2022-03-26T10:01:17ZStructured output regression for detection with partial truncationJournal articlehttp://purl.org/coar/resource_type/c_dcae04bcuuid:1141203e-5718-4380-b532-7791941ea15cEnglishSymplectic Elements at Oxford2009Vedaldi, AZisserman, AWe develop a structured output model for object category detection that explicitly accounts for alignment, multiple aspects and partial truncation in both training and inference. The model is formulated as large margin learning with latent variables and slack rescaling, and both training and inference are computationally efficient. We make the following contributions: (i) we note that extending the Structured Output Regression formulation of Blaschko and Lampert [1] to include a bias term significantly improves performance; (ii) that alignment (to account for small rotations and anisotropic scalings) can be included as a latent variable and efficiently determined and implemented; (iii) that the latent variable extends to multiple aspects (e.g. left facing, right facing, front) with the same formulation; and (iv), most significantly for performance, that truncated and truncated instances can be included in both training and inference with an explicit truncation mask. We demonstrate the method by training and testing on the PASCAL VOC 2007 data set - training includes the truncated examples, and in testing object instances are detected at multiple scales, alignments, and with significant truncations.
spellingShingle Vedaldi, A
Zisserman, A
Structured output regression for detection with partial truncation
title Structured output regression for detection with partial truncation
title_full Structured output regression for detection with partial truncation
title_fullStr Structured output regression for detection with partial truncation
title_full_unstemmed Structured output regression for detection with partial truncation
title_short Structured output regression for detection with partial truncation
title_sort structured output regression for detection with partial truncation
work_keys_str_mv AT vedaldia structuredoutputregressionfordetectionwithpartialtruncation
AT zissermana structuredoutputregressionfordetectionwithpartialtruncation