Learning invariant representations and applications to face verification

One approach to computer object recognition and modeling the brain's ventral stream involves unsupervised learning of representations that are invariant to common transformations. However, applications of these ideas have usually been limited to 2D affine transformations, e.g., translation and...

Full description

Bibliographic Details
Main Authors:	Liao, Qianli, Leibo, Joel Z., Poggio, Tomaso A.
Other Authors:	Massachusetts Institute of Technology. Department of Brain and Cognitive Sciences
Format:	Article
Language:	en_US
Published:	Neural Information Processing Systems Foundation 2014
Online Access:	http://hdl.handle.net/1721.1/92318 https://orcid.org/0000-0002-3153-916X https://orcid.org/0000-0002-3944-0455 https://orcid.org/0000-0003-0076-621X

_version_	1826209121293041664
author	Liao, Qianli Leibo, Joel Z. Poggio, Tomaso A.
author2	Massachusetts Institute of Technology. Department of Brain and Cognitive Sciences
author_facet	Massachusetts Institute of Technology. Department of Brain and Cognitive Sciences Liao, Qianli Leibo, Joel Z. Poggio, Tomaso A.
author_sort	Liao, Qianli
collection	MIT
description	One approach to computer object recognition and modeling the brain's ventral stream involves unsupervised learning of representations that are invariant to common transformations. However, applications of these ideas have usually been limited to 2D affine transformations, e.g., translation and scaling, since they are easiest to solve via convolution. In accord with a recent theory of transformation-invariance, we propose a model that, while capturing other common convolutional networks as special cases, can also be used with arbitrary identity-preserving transformations. The model's wiring can be learned from videos of transforming objects---or any other grouping of images into sets by their depicted object. Through a series of successively more complex empirical tests, we study the invariance/discriminability properties of this model with respect to different transformations. First, we empirically confirm theoretical predictions for the case of 2D affine transformations. Next, we apply the model to non-affine transformations: as expected, it performs well on face verification tasks requiring invariance to the relatively smooth transformations of 3D rotation-in-depth and changes in illumination direction. Surprisingly, it can also tolerate clutter transformations'' which map an image of a face on one background to an image of the same face on a different background. Motivated by these empirical findings, we tested the same model on face verification benchmark tasks from the computer vision literature: Labeled Faces in the Wild, PubFig and a new dataset we gathered---achieving strong performance in these highly unconstrained cases as well."
first_indexed	2024-09-23T14:17:53Z
format	Article
id	mit-1721.1/92318
institution	Massachusetts Institute of Technology
language	en_US
last_indexed	2024-09-23T14:17:53Z
publishDate	2014
publisher	Neural Information Processing Systems Foundation
record_format	dspace
spelling	mit-1721.1/923182022-10-01T20:25:37Z Learning invariant representations and applications to face verification Liao, Qianli Leibo, Joel Z. Poggio, Tomaso A. Massachusetts Institute of Technology. Department of Brain and Cognitive Sciences Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science McGovern Institute for Brain Research at MIT Liao, Qianli Leibo, Joel Z. Poggio, Tomaso A. One approach to computer object recognition and modeling the brain's ventral stream involves unsupervised learning of representations that are invariant to common transformations. However, applications of these ideas have usually been limited to 2D affine transformations, e.g., translation and scaling, since they are easiest to solve via convolution. In accord with a recent theory of transformation-invariance, we propose a model that, while capturing other common convolutional networks as special cases, can also be used with arbitrary identity-preserving transformations. The model's wiring can be learned from videos of transforming objects---or any other grouping of images into sets by their depicted object. Through a series of successively more complex empirical tests, we study the invariance/discriminability properties of this model with respect to different transformations. First, we empirically confirm theoretical predictions for the case of 2D affine transformations. Next, we apply the model to non-affine transformations: as expected, it performs well on face verification tasks requiring invariance to the relatively smooth transformations of 3D rotation-in-depth and changes in illumination direction. Surprisingly, it can also tolerate clutter transformations'' which map an image of a face on one background to an image of the same face on a different background. Motivated by these empirical findings, we tested the same model on face verification benchmark tasks from the computer vision literature: Labeled Faces in the Wild, PubFig and a new dataset we gathered---achieving strong performance in these highly unconstrained cases as well." 2014-12-16T15:01:38Z 2014-12-16T15:01:38Z 2013 Article http://purl.org/eprint/type/ConferencePaper 1049-5258 http://hdl.handle.net/1721.1/92318 Liao, Qianli, Joel Z. Leibo, and Tomaso Poggio. "Learning invariant representations and applications to face verification." Advances in Neural Information Processing Systems 26 (NIPS 2013). https://orcid.org/0000-0002-3153-916X https://orcid.org/0000-0002-3944-0455 https://orcid.org/0000-0003-0076-621X en_US Advances in Neural Information Processing Systems (NIPS) Creative Commons Attribution-Noncommercial-Share Alike http://creativecommons.org/licenses/by-nc-sa/4.0/ application/pdf Neural Information Processing Systems Foundation MIT Web Domain
spellingShingle	Liao, Qianli Leibo, Joel Z. Poggio, Tomaso A. Learning invariant representations and applications to face verification
title	Learning invariant representations and applications to face verification
title_full	Learning invariant representations and applications to face verification
title_fullStr	Learning invariant representations and applications to face verification
title_full_unstemmed	Learning invariant representations and applications to face verification
title_short	Learning invariant representations and applications to face verification
title_sort	learning invariant representations and applications to face verification
url	http://hdl.handle.net/1721.1/92318 https://orcid.org/0000-0002-3153-916X https://orcid.org/0000-0002-3944-0455 https://orcid.org/0000-0003-0076-621X
work_keys_str_mv	AT liaoqianli learninginvariantrepresentationsandapplicationstofaceverification AT leibojoelz learninginvariantrepresentationsandapplicationstofaceverification AT poggiotomasoa learninginvariantrepresentationsandapplicationstofaceverification

Learning invariant representations and applications to face verification

Similar Items