Structured learning of human interactions in TV shows

The objective of this work is recognition and spatiotemporal localization of two-person interactions in video. Our approach is person-centric. As a first stage we track all upper bodies and heads in a video using a tracking-by-detection approach that combines detections with KLT tracking and clique...

全面介紹

書目詳細資料
Main Authors: Patron-Perez, A, Marszalek, M, Reid, I, Zisserman, A
格式: Journal article
語言:English
出版: IEEE 2012