Structured learning of human interactions in TV shows

The objective of this work is recognition and spatiotemporal localization of two-person interactions in video. Our approach is person-centric. As a first stage we track all upper bodies and heads in a video using a tracking-by-detection approach that combines detections with KLT tracking and clique...

Description complète

Détails bibliographiques
Auteurs principaux: Patron-Perez, A, Marszalek, M, Reid, I, Zisserman, A
Format: Journal article
Langue:English
Publié: IEEE 2012

Documents similaires