Audio-Visual Activity Guided Cross-Modal Identity Association for Active Speaker Detection

Audio-Visual Activity Guided Cross-Modal Identity Association for Active Speaker Detection

Active speaker detection in videos addresses associating a source face, visible in the video frames, with the underlying speech in the audio modality. The two primary sources of information to derive such a speech-face relationship are i) visual activity and its interaction with the speech signal an...

Full description

Bibliographic Details
Main Authors:	Rahul Sharma, Shrikanth Narayanan
Format:	Article
Language:	English
Published:	IEEE 2023-01-01
Series:	IEEE Open Journal of Signal Processing
Subjects:	Active speaker detection character identity cross-modal speaker recognition
Online Access:	https://ieeexplore.ieee.org/document/10102534/

Similar Items

Speaker Recognition: Progression and challenges
by: Yusra Al-Irahyim, et al.
Published: (2021-09-01)

Speaker Recognition in Uncontrolled Environment: A Review
by: Karamangala Narendra, et al.
Published: (2013-03-01)

A high-performance text-independent speaker identification of Arabic speakers using a CHMM-based approach
by: Hesham Tolba
Published: (2011-03-01)

An Automatic Speaker Clustering Pipeline for the Air Traffic Communication Domain
by: Driss Khalil, et al.
Published: (2023-10-01)

Speaker Recognition Systems in the Last Decade – A Survey
by: Ahmed M. Ahmed, et al.
Published: (2021-03-01)

Real Time Recognition Of Speakers From Internet Audio Stream
by: Weychan Radoslaw, et al.
Published: (2015-09-01)

Self Attention Networks in Speaker Recognition
by: Pooyan Safari, et al.
Published: (2023-05-01)

Residual Information in Deep Speaker Embedding Architectures
by: Adriana Stan
Published: (2022-10-01)

Global–Local Self-Attention Based Transformer for Speaker Verification
by: Fei Xie, et al.
Published: (2022-10-01)

Modeling Long-Term Multimodal Representations for Active Speaker Detection With Spatio-Positional Encoder
by: Minyoung Kyoung, et al.
Published: (2023-01-01)

Speaker-turn aware diarization for speech-based cognitive assessments
by: Sean Shensheng Xu, et al.
Published: (2024-01-01)

Analysis of transition cost and model parameters in speaker diarization for meetings
by: Beatriz Martínez-González, et al.
Published: (2021-02-01)

Speaker Recognition Based on Semantic Indexing
by: Mohammed Sahib Altaei, et al.
Published: (2011-03-01)

Using Neural Network with Speaker Applications
by: Baghdad Science Journal
Published: (2010-06-01)

Forensic Automatic Speaker Recognition Based on Likelihood Ratio Using Acoustic-phonetic Features Measured Automatically
by: Huapeng Wang, et al.
Published: (2015-01-01)

Local Control of Audio Environment: A Review of Methods and Applications
by: Jussi Kuutti, et al.
Published: (2014-02-01)

Speaker Diarization and Identification From Single Channel Classroom Audio Recordings Using Virtual Microphones
by: Antonio Gomez, et al.
Published: (2022-01-01)

Research of speaker recognition system based on GMM-SVM
by: ZHAO Lihui, et al.
Published: (2014-05-01)

Inaudible Attack on AI Speakers
by: Seyitmammet Saparmammedovich Alchekov, et al.
Published: (2023-04-01)

A contrastive analysis of epistemic modality in scientific English
by: María Luisa Carrió Pastor
Published: (2015-03-01)

Multilingual Audio-Visual Smartphone Dataset and Evaluation
by: Hareesh Mandalapu, et al.
Published: (2021-01-01)

A Survey on Text-Dependent and Text-Independent Speaker Verification
by: Youzhi Tu, et al.
Published: (2022-01-01)

Speaker diarization and speech recognition in the semi-automatization of audio description: An exploratory study on future possibilities?
by: Héctor Delgado, et al.
Published: (2015-06-01)

Weighted Cluster-Range Loss and Criticality-Enhancement Loss for Speaker Recognition
by: Jianye Mo, et al.
Published: (2020-12-01)

Becoming IELTS Examiners: Demystifying Native-Speakerism in the Area of English Language Testing
by: Pritz Hutabarat
Published: (2022-10-01)

Evaluating the Performance of Speaker Recognition Solutions in E-Commerce Applications
by: Olja Krčadinac, et al.
Published: (2021-09-01)

An Analysis of the Short Utterance Problem for Speaker Characterization
by: Ignacio Viñals, et al.
Published: (2019-09-01)

An Experimental Comparison of Modeling Techniques and Combination of Speaker – Specific Information from Different Languages for Multilingual Speaker Identification
by: Jayanna H.S., et al.
Published: (2016-10-01)

Improving Speaker Recognition by Biometric Voice Deconstruction
by: Luis Miguel eMazaira-Fernández, et al.
Published: (2015-09-01)

ASVtorch toolkit: Speaker verification with deep neural networks
by: Kong Aik Lee, et al.
Published: (2021-06-01)

Our speaker this evening : practical etiquatte manual for masters of ceremonies, committee chairmen and pastors /
by: 223722 Markley, Kenneth A.
Published: (1974)

COMPARATIVE ANALYSIS OF NEURAL NETWORK MODELS FOR THE PROBLEM OF SPEAKER RECOGNITION
by: Vladyslav Kholiev, et al.
Published: (2023-08-01)

A Simple Unsupervised Knowledge-Free Domain Adaptation for Speaker Recognition
by: Wan Lin, et al.
Published: (2024-01-01)

Laplacian Operator as Speaker Identification Parameter
by: S. K. Jamil
Published: (2009-12-01)

Unsupervised Learning of Total Variability Embedding for Speaker Verification with Random Digit Strings
by: Woo Hyun Kang, et al.
Published: (2019-04-01)

Automatic Speaker Recognition Dependency on Both the Shape of Auditory Critical Bands and Speaker Discriminative MFCCs
by: JOKIC, I., et al.
Published: (2015-11-01)

On the native/nonnative speaker notion and World Englishes: Debating with K. Rajagopalan
by: John Robert SCHMITZ

Multi-Accent Speaker Detection Using Normalize Feature MFCC Neural Network Method
by: Kristiawan Nugroho, et al.
Published: (2023-08-01)

Restricted Boltzmann Machine Vectors for Speaker Clustering and Tracking Tasks in TV Broadcast Shows
by: Umair Khan, et al.
Published: (2019-07-01)

Speaking with a KN95 face mask: a within-subjects study on speaker adaptation and strategies to improve intelligibility
by: Sarah E. Gutz, et al.
Published: (2022-07-01)