Taxonomic multi-class prediction and person layout using efficient structured ranking

In computer vision efficient multi-class classification is becoming a key problem as the field develops and the number of object classes to be identified increases. Often objects might have some sort of structure such as a taxonomy in which the mis-classification score for object classes close by, u...

Descripción completa

Detalles Bibliográficos
Autores principales: Mittal, A, Blaschko, MB, Zisserman, A, Torr, PHS
Formato: Conference item
Lenguaje:English
Publicado: Springer 2012
_version_ 1826314366852530176
author Mittal, A
Blaschko, MB
Zisserman, A
Torr, PHS
author_facet Mittal, A
Blaschko, MB
Zisserman, A
Torr, PHS
author_sort Mittal, A
collection OXFORD
description In computer vision efficient multi-class classification is becoming a key problem as the field develops and the number of object classes to be identified increases. Often objects might have some sort of structure such as a taxonomy in which the mis-classification score for object classes close by, using tree distance within the taxonomy, should be less than for those far apart. This is an example of multi-class classification in which the loss function has a special structure. Another example in vision is for the ubiquitous pictorial structure or parts based model. In this case we would like the mis-classification score to be proportional to the number of parts misclassified. <br> It transpires both of these are examples of structured output ranking problems. However, so far no efficient large scale algorithm for this problem has been demonstrated. In this work we propose an algorithm for structured output ranking that can be trained in a time linear in the number of samples under a mild assumption common to many computer vision problems: that the loss function can be discretized into a small number of values. <br> We show the feasibility of structured ranking on these two core computer vision problems and demonstrate a consistent and substantial improvement over competing techniques. Aside from this, we also achieve state-of-the art results for the PASCAL VOC human layout problem.
first_indexed 2024-09-25T04:31:26Z
format Conference item
id oxford-uuid:24f54ebd-0be0-42e6-a026-d4d8b46acdf1
institution University of Oxford
language English
last_indexed 2024-09-25T04:31:26Z
publishDate 2012
publisher Springer
record_format dspace
spelling oxford-uuid:24f54ebd-0be0-42e6-a026-d4d8b46acdf12024-09-03T16:15:18ZTaxonomic multi-class prediction and person layout using efficient structured rankingConference itemhttp://purl.org/coar/resource_type/c_5794uuid:24f54ebd-0be0-42e6-a026-d4d8b46acdf1EnglishSymplectic ElementsSpringer2012Mittal, ABlaschko, MBZisserman, ATorr, PHSIn computer vision efficient multi-class classification is becoming a key problem as the field develops and the number of object classes to be identified increases. Often objects might have some sort of structure such as a taxonomy in which the mis-classification score for object classes close by, using tree distance within the taxonomy, should be less than for those far apart. This is an example of multi-class classification in which the loss function has a special structure. Another example in vision is for the ubiquitous pictorial structure or parts based model. In this case we would like the mis-classification score to be proportional to the number of parts misclassified. <br> It transpires both of these are examples of structured output ranking problems. However, so far no efficient large scale algorithm for this problem has been demonstrated. In this work we propose an algorithm for structured output ranking that can be trained in a time linear in the number of samples under a mild assumption common to many computer vision problems: that the loss function can be discretized into a small number of values. <br> We show the feasibility of structured ranking on these two core computer vision problems and demonstrate a consistent and substantial improvement over competing techniques. Aside from this, we also achieve state-of-the art results for the PASCAL VOC human layout problem.
spellingShingle Mittal, A
Blaschko, MB
Zisserman, A
Torr, PHS
Taxonomic multi-class prediction and person layout using efficient structured ranking
title Taxonomic multi-class prediction and person layout using efficient structured ranking
title_full Taxonomic multi-class prediction and person layout using efficient structured ranking
title_fullStr Taxonomic multi-class prediction and person layout using efficient structured ranking
title_full_unstemmed Taxonomic multi-class prediction and person layout using efficient structured ranking
title_short Taxonomic multi-class prediction and person layout using efficient structured ranking
title_sort taxonomic multi class prediction and person layout using efficient structured ranking
work_keys_str_mv AT mittala taxonomicmulticlasspredictionandpersonlayoutusingefficientstructuredranking
AT blaschkomb taxonomicmulticlasspredictionandpersonlayoutusingefficientstructuredranking
AT zissermana taxonomicmulticlasspredictionandpersonlayoutusingefficientstructuredranking
AT torrphs taxonomicmulticlasspredictionandpersonlayoutusingefficientstructuredranking