Face, body, voice: video person-clustering with multiple modalities
The objective of this work is person-clustering in videos – grouping characters according to their identity. Previous methods focus on the narrower task of face-clustering, and for the most part ignore other cues such as the person’s voice, their overall appearance (hair, clothes, posture), and the...
Main Authors: | Brown, A, Kalogeiton, V, Zisserman, A |
---|---|
Format: | Conference item |
Language: | English |
Published: |
IEEE
2021
|
Similar Items
-
Constrained video face clustering using 1NN relations
by: Kalogeiton, V, et al.
Published: (2020) -
Seeing voices and hearing faces: Cross-modal biometric matching
by: Nagrani, A, et al.
Published: (2018) -
LAEO-Net++: revisiting people looking at each other in videos
by: Marin-Jimenez, MJ, et al.
Published: (2020) -
LAEO-Net: Revisiting people looking at each other in videos
by: Marin-Jimenez, M, et al.
Published: (2020) -
Modality-specific brain representations during automatic processing of face, voice and body expressions
by: Maarten Vaessen, et al.
Published: (2023-10-01)