HeadLock : wide-range head pose estimation for low resolution video

Thesis (S.M.)--Massachusetts Institute of Technology, School of Architecture and Planning, Program in Media Arts and Sciences, February 2008.

Bibliographic Details
Main Author: DeCamp, Philip (Philip James)
Other Authors: Deb Roy.
Format: Thesis
Language:eng
Published: Massachusetts Institute of Technology 2008
Subjects:
Online Access:http://hdl.handle.net/1721.1/42411
_version_ 1826200253860151296
author DeCamp, Philip (Philip James)
author2 Deb Roy.
author_facet Deb Roy.
DeCamp, Philip (Philip James)
author_sort DeCamp, Philip (Philip James)
collection MIT
description Thesis (S.M.)--Massachusetts Institute of Technology, School of Architecture and Planning, Program in Media Arts and Sciences, February 2008.
first_indexed 2024-09-23T11:33:38Z
format Thesis
id mit-1721.1/42411
institution Massachusetts Institute of Technology
language eng
last_indexed 2024-09-23T11:33:38Z
publishDate 2008
publisher Massachusetts Institute of Technology
record_format dspace
spelling mit-1721.1/424112019-04-10T21:59:01Z HeadLock : wide-range head pose estimation for low resolution video Wide-range head pose estimation for low resolution video DeCamp, Philip (Philip James) Deb Roy. Massachusetts Institute of Technology. Dept. of Architecture. Program in Media Arts and Sciences. Massachusetts Institute of Technology. Dept. of Architecture. Program in Media Arts and Sciences. Architecture. Program in Media Arts and Sciences. Thesis (S.M.)--Massachusetts Institute of Technology, School of Architecture and Planning, Program in Media Arts and Sciences, February 2008. Includes bibliographical references (p. 85-87). This thesis focuses on data mining technologies to extract head pose information from low resolution video recordings. Head pose, as an approximation of gaze direction, is a key indicator of human behavior and interaction. Extracting head pose information from video recordings is a labor intensive endeavor that severely limits the feasibility of using large video corpora to perform tasks that require analysis of human behavior. HeadLock is a novel head pose annotation and tracking tool. Pose annotation is formulated as a semiautomatic process in which a human annotator is aided by computationally generated head pose estimates, significantly reducing the human effort required to accurately annotate video recordings. HeadLock has been designed to perform head pose tracking on video from overhead, wide-angle cameras. The head pose estimation system used by HeadLock can perform pose estimation to arbitrary precision on images that reveal only the top or back of a head. This system takes a 3D model-based approach in which heads are modeled as 3D surfaces covered with localized features. The set of features used can be reliably extracted from both hair and skin regions at any resolution, providing better performance for images that may contain small facial regions and no discernible facial features. HeadLock is evaluated on video recorded for the Human Speechome Project (HSP), a research initiative to study human language development by analyzing longitudinal audio-video recordings of a developing child. Results indicate that HeadLock may enable annotation of head pose at ten times the speed of a manual approach. In addition to head tracking, this thesis describes the data collection and data management systems that have been developed for HSP, providing a comprehensive example of how very large corpora of video recordings may be used to research human development, health and behavior. by Philip DeCamp. S.M. 2008-09-03T15:35:00Z 2008-09-03T15:35:00Z 2007 2008 Thesis http://hdl.handle.net/1721.1/42411 237210074 eng M.I.T. theses are protected by copyright. They may be viewed from this source for any purpose, but reproduction or distribution in any format is prohibited without written permission. See provided URL for inquiries about permission. http://dspace.mit.edu/handle/1721.1/7582 87 p. application/pdf Massachusetts Institute of Technology
spellingShingle Architecture. Program in Media Arts and Sciences.
DeCamp, Philip (Philip James)
HeadLock : wide-range head pose estimation for low resolution video
title HeadLock : wide-range head pose estimation for low resolution video
title_full HeadLock : wide-range head pose estimation for low resolution video
title_fullStr HeadLock : wide-range head pose estimation for low resolution video
title_full_unstemmed HeadLock : wide-range head pose estimation for low resolution video
title_short HeadLock : wide-range head pose estimation for low resolution video
title_sort headlock wide range head pose estimation for low resolution video
topic Architecture. Program in Media Arts and Sciences.
url http://hdl.handle.net/1721.1/42411
work_keys_str_mv AT decampphilipphilipjames headlockwiderangeheadposeestimationforlowresolutionvideo
AT decampphilipphilipjames widerangeheadposeestimationforlowresolutionvideo