Audio-Visual Overlapped Speech Detection for Spontaneous Distant Speech
Although advances in deep learning have brought remarkable improvements to Overlapped Speech Detection (OSD), the performance in far-field environments is still limited owing to the lack of real-world overlapped speech and a low signal-to-noise ratio. In this paper, we present an end-to-end audiovis...
Main Authors: | Minyoung Kyoung, Hyungbae Jeon, Kiyoung Park |
---|---|
Format: | Article |
Language: | English |
Published: |
IEEE
2023-01-01
|
Series: | IEEE Access |
Subjects: | |
Online Access: | https://ieeexplore.ieee.org/document/10064301/ |
Similar Items
-
ANALYSIS OF MULTIMODAL FUSION TECHNIQUES FOR AUDIO-VISUAL SPEECH RECOGNITION
by: D.V. Ivanko, et al.
Published: (2016-05-01) -
Noise-Robust Multimodal Audio-Visual Speech Recognition System for Speech-Based Interaction Applications
by: Sanghun Jeon, et al.
Published: (2022-10-01) -
Multimodal Sensor-Input Architecture with Deep Learning for Audio-Visual Speech Recognition in Wild
by: Yibo He, et al.
Published: (2023-02-01) -
Separation of Audio-Visual Speech Sources: A New Approach Exploiting the Audio-Visual Coherence of Speech Stimuli
by: Sodoyer David, et al.
Published: (2002-01-01) -
TIMIT-TTS: A Text-to-Speech Dataset for Multimodal Synthetic Media Detection
by: Davide Salvi, et al.
Published: (2023-01-01)