Character-aware audio-visual subtitling in context

This paper presents an improved framework for character-aware audio-visual subtitling in TV shows. Our approach integrates speech recognition, speaker diarisation, and character recognition, utilising both audio and visual cues. This holistic solution addresses what is said, when it’s said, and who...

Ausführliche Beschreibung

Bibliographische Detailangaben
Hauptverfasser: Huh, J, Zisserman, A
Format: Conference item
Sprache:English
Veröffentlicht: Springer 2024