Learning to count objects in images

We propose a new supervised learning framework for visual object counting tasks, such as estimating the number of cells in a microscopic image or the number of humans in surveillance video frames. We focus on the practically-attractive case when the training images are annotated with dots (one dot p...

Full description

Bibliographic Details
Main Authors: Lempitsky, V, Zisserman, A
Format: Conference item
Language:English
Published: Curran Associates 2011
_version_ 1824459128600264704
author Lempitsky, V
Zisserman, A
author_facet Lempitsky, V
Zisserman, A
author_sort Lempitsky, V
collection OXFORD
description We propose a new supervised learning framework for visual object counting tasks, such as estimating the number of cells in a microscopic image or the number of humans in surveillance video frames. We focus on the practically-attractive case when the training images are annotated with dots (one dot per object). Our goal is to accurately estimate the count. However, we evade the hard task of learning to detect and localize individual object instances. Instead, we cast the problem as that of estimating an image density whose integral over any image region gives the count of objects within that region. Learning to infer such density can be formulated as a minimization of a regularized risk quadratic cost function. We introduce a new loss function, which is well-suited for such learning, and at the same time can be computed efficiently via a maximum subarray algorithm. The learning can then be posed as a convex quadratic program solvable with cutting-plane optimization. The proposed framework is very flexible as it can accept any domain-specific visual features. Once trained, our system provides accurate object counts and requires a very small time overhead over the feature extraction step, making it a good candidate for applications involving real-time processing or dealing with huge amount of visual data.
first_indexed 2025-02-19T04:36:52Z
format Conference item
id oxford-uuid:c75861b1-cd39-4929-be13-3b3446a1efe9
institution University of Oxford
language English
last_indexed 2025-02-19T04:36:52Z
publishDate 2011
publisher Curran Associates
record_format dspace
spelling oxford-uuid:c75861b1-cd39-4929-be13-3b3446a1efe92025-01-30T11:14:56ZLearning to count objects in imagesConference itemhttp://purl.org/coar/resource_type/c_5794uuid:c75861b1-cd39-4929-be13-3b3446a1efe9EnglishSymplectic ElementsCurran Associates2011Lempitsky, VZisserman, AWe propose a new supervised learning framework for visual object counting tasks, such as estimating the number of cells in a microscopic image or the number of humans in surveillance video frames. We focus on the practically-attractive case when the training images are annotated with dots (one dot per object). Our goal is to accurately estimate the count. However, we evade the hard task of learning to detect and localize individual object instances. Instead, we cast the problem as that of estimating an image density whose integral over any image region gives the count of objects within that region. Learning to infer such density can be formulated as a minimization of a regularized risk quadratic cost function. We introduce a new loss function, which is well-suited for such learning, and at the same time can be computed efficiently via a maximum subarray algorithm. The learning can then be posed as a convex quadratic program solvable with cutting-plane optimization. The proposed framework is very flexible as it can accept any domain-specific visual features. Once trained, our system provides accurate object counts and requires a very small time overhead over the feature extraction step, making it a good candidate for applications involving real-time processing or dealing with huge amount of visual data.
spellingShingle Lempitsky, V
Zisserman, A
Learning to count objects in images
title Learning to count objects in images
title_full Learning to count objects in images
title_fullStr Learning to count objects in images
title_full_unstemmed Learning to count objects in images
title_short Learning to count objects in images
title_sort learning to count objects in images
work_keys_str_mv AT lempitskyv learningtocountobjectsinimages
AT zissermana learningtocountobjectsinimages