Names and faces in the news

We show quite good face clustering is possible for a dataset of inaccurately and ambiguously labelled face images. Our dataset is 44,773 face images, obtained by applying a face finder to approximately half a million captioned news images. This dataset is more realistic than usual face recognition d...

Full description

Bibliographic Details
Main Authors: Berg, T, Berg, A, Edwards, J, Maire, M, White, R, Teh, Y, Learned-Miller, E, Forsyth, D, Society, IEEEC
Format: Journal article
Language:English
Published: 2004
_version_ 1797056752411213824
author Berg, T
Berg, A
Edwards, J
Maire, M
White, R
Teh, Y
Learned-Miller, E
Forsyth, D
Society, IEEEC
author_facet Berg, T
Berg, A
Edwards, J
Maire, M
White, R
Teh, Y
Learned-Miller, E
Forsyth, D
Society, IEEEC
author_sort Berg, T
collection OXFORD
description We show quite good face clustering is possible for a dataset of inaccurately and ambiguously labelled face images. Our dataset is 44,773 face images, obtained by applying a face finder to approximately half a million captioned news images. This dataset is more realistic than usual face recognition datasets, because it contains faces captured "in the wild" in a variety of configurations with respect to the camera, taking a variety of expressions, and under illumination of widely varying color. Each face image is associated with a set of names, automatically extracted from the associated caption. Many, but not all such sets contain the correct name. We cluster face images in appropriate discriminant coordinates. We use a clustering procedure to break ambiguities in labelling and identify incorrectly labelled faces. A merging procedure then identifies variants of names that refer to the same individual. The resulting representation can be used to label faces in news images or to organize news pictures by individuals present. An alternative view of our procedure is as a process that cleans up noisy supervised data. We demonstrate how to use entropy measures to evaluate such procedures.
first_indexed 2024-03-06T19:27:03Z
format Journal article
id oxford-uuid:1c20e9bb-a516-483c-ba4a-550fcc839a6b
institution University of Oxford
language English
last_indexed 2024-03-06T19:27:03Z
publishDate 2004
record_format dspace
spelling oxford-uuid:1c20e9bb-a516-483c-ba4a-550fcc839a6b2022-03-26T11:04:01ZNames and faces in the newsJournal articlehttp://purl.org/coar/resource_type/c_dcae04bcuuid:1c20e9bb-a516-483c-ba4a-550fcc839a6bEnglishSymplectic Elements at Oxford2004Berg, TBerg, AEdwards, JMaire, MWhite, RTeh, YLearned-Miller, EForsyth, DSociety, IEEECWe show quite good face clustering is possible for a dataset of inaccurately and ambiguously labelled face images. Our dataset is 44,773 face images, obtained by applying a face finder to approximately half a million captioned news images. This dataset is more realistic than usual face recognition datasets, because it contains faces captured "in the wild" in a variety of configurations with respect to the camera, taking a variety of expressions, and under illumination of widely varying color. Each face image is associated with a set of names, automatically extracted from the associated caption. Many, but not all such sets contain the correct name. We cluster face images in appropriate discriminant coordinates. We use a clustering procedure to break ambiguities in labelling and identify incorrectly labelled faces. A merging procedure then identifies variants of names that refer to the same individual. The resulting representation can be used to label faces in news images or to organize news pictures by individuals present. An alternative view of our procedure is as a process that cleans up noisy supervised data. We demonstrate how to use entropy measures to evaluate such procedures.
spellingShingle Berg, T
Berg, A
Edwards, J
Maire, M
White, R
Teh, Y
Learned-Miller, E
Forsyth, D
Society, IEEEC
Names and faces in the news
title Names and faces in the news
title_full Names and faces in the news
title_fullStr Names and faces in the news
title_full_unstemmed Names and faces in the news
title_short Names and faces in the news
title_sort names and faces in the news
work_keys_str_mv AT bergt namesandfacesinthenews
AT berga namesandfacesinthenews
AT edwardsj namesandfacesinthenews
AT mairem namesandfacesinthenews
AT whiter namesandfacesinthenews
AT tehy namesandfacesinthenews
AT learnedmillere namesandfacesinthenews
AT forsythd namesandfacesinthenews
AT societyieeec namesandfacesinthenews