Utterance-level aggregation for speaker recognition in the wild

Utterance-level aggregation for speaker recognition in the wild

The objective of this paper is speaker recognition `in the wild' - where utterances may be of variable length and also contain irrelevant signals. Crucial elements in the design of deep networks for this task are the type of trunk (frame level) network, and the method of temporal aggregation. W...

Full description

Bibliographic Details
Main Authors:	Xie, W, Nagrani, A, Chung, J, Zisserman, A
Format:	Conference item
Published:	IEEE 2019

Similar Items

Voxceleb: large-scale speaker verification in the wild
by: Nagrani, A, et al.
Published: (2019)

Spot the conversation: Speaker diarisation in the wild
by: Chung, JS, et al.
Published: (2020)

VoxCeleb2: Deep speaker recognition
by: Chung, J, et al.
Published: (2018)

The VoxCeleb speaker recognition challenge: a retrospective
by: Huh, J, et al.
Published: (2024)

VoxCeleb: a large-scale speaker identification dataset
by: Nagrani, A, et al.
Published: (2017)

Emotion Recognition in Speech using Cross-Modal Transfer in the Wild
by: Albanie, S, et al.
Published: (2018)

Emotion recognition in speech using cross-modal transfer in the wild
by: Albanie, S, et al.
Published: (2018)

Count, crop and recognise: fine-grained recognition in the wild
by: Bain, M, et al.
Published: (2020)

Audio-visual synchronisation in the wild
by: Chen, H, et al.
Published: (2021)

Self-supervised utterance order prediction for emotion recognition in conversations
by: Jiang, Dazhi, et al.
Published: (2024)

Chimpanzee face recognition from videos in the wild using deep learning
by: Schofield, D, et al.
Published: (2019)

Playing a part: speaker verification at the movies
by: Brown, A, et al.
Published: (2021)

Forgiving as a performative utterance
by: Swinburne, R
Published: (2021)

Automatic utterance segmentation in spontaneous speech
by: Yoshida, Norimasa, 1979-
Published: (2006)

Lip reading in the wild
by: Chung, J, et al.
Published: (2017)

Forensic speaker recognition
by: Bonstre, Jean-Francois, et al.
Published: (2010)

Speaker recognition application
by: Li, Yao Dong.
Published: (2013)

Speaker recognition system
by: Song, Liyan.
Published: (2012)

Understanding utterances : The need for pragmatic competence / Kamisah Ariffin
by: Ariffin, Kamisah
Published: (2004)

Acoustic measurements for speaker recognition.
by: Wolf, Jared John
Published: (2005)

Mobile phone speaker recognition
by: Thang, Hui Ru.
Published: (2013)

Speech intelligiblity and speaker recognition/
by: Hawley, Mones E.
Published: (1977)

Out of time: automated lip sync in the wild
by: Chung, J, et al.
Published: (2017)

Single channel speech separation with constrained utterance level permutation invariant training using grid LSTM
by: Xu, Chenglin, et al.
Published: (2020)

Two-Word Usage: Cognitive Development And The Beginnings Of Combinatorial Utterance.
by: Zakaria, Noraizzah Haji
Published: (1990)

Azuichi, short utterance and responses in Japanese conversation on interview context
by: Shariff, Muhammad Haikal, et al.
Published: (2015)

Utterance verification in large vocabulary spoken language understanding system
by: Yao, Huan, 1976-
Published: (2009)

Processing of speech utterances for computer aided training of speaking skills
by: Zhao, Sixuan
Published: (2014)

Tonal coarticulation in Thai disyllabic utterances: a preliminary study
by: Gandour, Jack, et al.
Published: (2024)

It's about time: analog clock reading in the wild
by: Yang, C, et al.
Published: (2022)

Slow-fast auditory streams for audio recognition
by: Kazakos, E, et al.
Published: (2021)

Lip Reading Sentences in the Wild
by: Chung, J, et al.
Published: (2017)

Design of a speaker recognition system
by: Zhao, Guangye
Published: (2019)

Speaker recognition using neural network
by: Zul Rasyied, Ab. Rasat
Published: (2012)

ASR dependent techniques for speaker recognition
by: Park, Alex S. (Alex Seungryong), 1979-
Published: (2014)

Speaker independent continuous speech recognition
by: Soon, Ing Yann.
Published: (2012)

Mobile phone speaker recognition application
by: Wong, Joseph Pin Jie
Published: (2015)

Mobile phone speaker recognition application
by: Tan, Xavier Junjie
Published: (2014)

Security and privacy in speaker recognition systems
by: Turner, H
Published: (2021)

Amodal ground truth and completion in the wild
by: Zhan, G, et al.
Published: (2024)