Visual recognition of multi-agent action
Thesis (Ph.D.)--Massachusetts Institute of Technology, School of Architecture and Planning, Program in Media Arts and Sciences, 1999.
Main Author: | |
---|---|
Other Authors: | |
Format: | Thesis |
Language: | eng |
Published: |
Massachusetts Institute of Technology
2005
|
Subjects: | |
Online Access: | http://hdl.handle.net/1721.1/9374 |
_version_ | 1826216459052777472 |
---|---|
author | Intille, Stephen S. (Stephen Sean) |
author2 | Aaron F. Bobick. |
author_facet | Aaron F. Bobick. Intille, Stephen S. (Stephen Sean) |
author_sort | Intille, Stephen S. (Stephen Sean) |
collection | MIT |
description | Thesis (Ph.D.)--Massachusetts Institute of Technology, School of Architecture and Planning, Program in Media Arts and Sciences, 1999. |
first_indexed | 2024-09-23T16:48:01Z |
format | Thesis |
id | mit-1721.1/9374 |
institution | Massachusetts Institute of Technology |
language | eng |
last_indexed | 2024-09-23T16:48:01Z |
publishDate | 2005 |
publisher | Massachusetts Institute of Technology |
record_format | dspace |
spelling | mit-1721.1/93742019-04-11T13:39:04Z Visual recognition of multi-agent action Intille, Stephen S. (Stephen Sean) Aaron F. Bobick. Massachusetts Institute of Technology. Dept. of Architecture. Program in Media Arts and Sciences. Massachusetts Institute of Technology. Dept. of Architecture. Program in Media Arts and Sciences. Architecture. Program in Media Arts and Sciences. Thesis (Ph.D.)--Massachusetts Institute of Technology, School of Architecture and Planning, Program in Media Arts and Sciences, 1999. Includes bibliographical references (p. 167-184). Developing computer vision sensing systems that work robustly in everyday environments will require that the systems can recognize structured interaction between people and objects in the world. This document presents a new theory for the representation and recognition of coordinated multi-agent action from noisy perceptual data. The thesis of this work is as follows: highly structured, multi-agent action can be recognized from noisy perceptual data using visually grounded goal-based primitives and low-order temporal relationships that are integrated in a probabilistic framework. The theory is developed and evaluated by examining general characteristics of multi-agent action, analyzing tradeoffs involved when selecting a representation for multi-agent action recognition, and constructing a system to recognize multi-agent action for a real task from noisy data. The representation, which is motivated by work in model-based object recognition and probabilistic plan recognition, makes four principal assumptions: (1) the goals of individual agents are natural atomic representational units for specifying the temporal relationships between agents engaged in group activities, (2) a high-level description of temporal structure of the action using a small set of low-order temporal and logical constraints is adequate for representing the relationships between the agent goals for highly structured, multi-agent action recognition, (3) Bayesian networks provide a suitable mechanism for integrating multiple sources of uncertain visual perceptual feature evidence, and (4) an automatically generated Bayesian network can be used to combine uncertain temporal information and compute the likelihood that a set of object trajectory data is a particular multi-agent action. The recognition algorithm is tested using a database of American football play descriptions. A system is described that can recognize single-agent and multi-agent actions in this domain given noisy trajectories of object movements. The strengths and limitations of the recognition system are discussed and compared with other multi-agent recognition algorithms. by Stephen Sean Intille. Ph.D. 2005-08-22T20:41:53Z 2005-08-22T20:41:53Z 1999 1999 Thesis http://hdl.handle.net/1721.1/9374 44815054 eng M.I.T. theses are protected by copyright. They may be viewed from this source for any purpose, but reproduction or distribution in any format is prohibited without written permission. See provided URL for inquiries about permission. http://dspace.mit.edu/handle/1721.1/7582 237 p. 23167653 bytes 23167410 bytes application/pdf application/pdf application/pdf Massachusetts Institute of Technology |
spellingShingle | Architecture. Program in Media Arts and Sciences. Intille, Stephen S. (Stephen Sean) Visual recognition of multi-agent action |
title | Visual recognition of multi-agent action |
title_full | Visual recognition of multi-agent action |
title_fullStr | Visual recognition of multi-agent action |
title_full_unstemmed | Visual recognition of multi-agent action |
title_short | Visual recognition of multi-agent action |
title_sort | visual recognition of multi agent action |
topic | Architecture. Program in Media Arts and Sciences. |
url | http://hdl.handle.net/1721.1/9374 |
work_keys_str_mv | AT intillestephensstephensean visualrecognitionofmultiagentaction |