Object level grouping for video shots

<p>We describe a method for automatically associating image patches from frames of a movie shot into object-level groups. The method employs both the appearance and motion of the patches.</p> <p>There are two areas of innovation: first, affine invariant regions are used to repair s...

وصف كامل

التفاصيل البيبلوغرافية
المؤلفون الرئيسيون: Sivic, J, Schaffalitzky, F, Zisserman, A
التنسيق: Conference item
اللغة:English
منشور في: Springer 2004
الوصف
الملخص:<p>We describe a method for automatically associating image patches from frames of a movie shot into object-level groups. The method employs both the appearance and motion of the patches.</p> <p>There are two areas of innovation: first, affine invariant regions are used to repair short gaps in individual tracks and also to join sets of tracks across occlusions (where many tracks are lost simultaneously); second, a robust affine factorization method is developed which is able to cope with motion degeneracy. This factorization is used to associate tracks into object-level groups.</p> <p>The outcome is that separate parts of an object that are never visible simultaneously in a single frame are associated together. For example, the front and back of a car, or the front and side of a face. In turn this enables object-level matching and recognition throughout a video.</p> <p>We illustrate the method for a number of shots from the feature film ‘Groundhog Day’.</p>