Structured learning of human interactions in TV shows
The objective of this work is recognition and spatiotemporal localization of two-person interactions in video. Our approach is person-centric. As a first stage we track all upper bodies and heads in a video using a tracking-by-detection approach that combines detections with KLT tracking and clique...
Main Authors: | , , , |
---|---|
Format: | Journal article |
Language: | English |
Published: |
IEEE
2012
|