Learnable PINs: Cross-modal embeddings for person identity

We propose and investigate an identity sensitive joint embedding of face and voice. Such an embedding enables cross-modal retrieval from voice to face and from face to voice. We make the following four contributions: first, we show that the embedding can be learnt from videos of talking faces, witho...

وصف كامل

التفاصيل البيبلوغرافية
المؤلفون الرئيسيون: Nagrani, A, Albanie, S, Zisserman, A
التنسيق: Conference item
منشور في: Springer 2018