Utterance-level aggregation for speaker recognition in the wild

Utterance-level aggregation for speaker recognition in the wild

The objective of this paper is speaker recognition `in the wild' - where utterances may be of variable length and also contain irrelevant signals. Crucial elements in the design of deep networks for this task are the type of trunk (frame level) network, and the method of temporal aggregation. W...

Disgrifiad llawn

Manylion Llyfryddiaeth
Prif Awduron:	Xie, W, Nagrani, A, Chung, J, Zisserman, A
Fformat:	Conference item
Cyhoeddwyd:	IEEE 2019

Eitemau Tebyg

Voxceleb: large-scale speaker verification in the wild
gan: Nagrani, A, et al.
Cyhoeddwyd: (2019)

Spot the conversation: Speaker diarisation in the wild
gan: Chung, JS, et al.
Cyhoeddwyd: (2020)

VoxCeleb2: Deep speaker recognition
gan: Chung, J, et al.
Cyhoeddwyd: (2018)

The VoxCeleb speaker recognition challenge: a retrospective
gan: Huh, J, et al.
Cyhoeddwyd: (2024)

VoxCeleb: a large-scale speaker identification dataset
gan: Nagrani, A, et al.
Cyhoeddwyd: (2017)

Emotion Recognition in Speech using Cross-Modal Transfer in the Wild
gan: Albanie, S, et al.
Cyhoeddwyd: (2018)

Emotion recognition in speech using cross-modal transfer in the wild
gan: Albanie, S, et al.
Cyhoeddwyd: (2018)

Count, crop and recognise: fine-grained recognition in the wild
gan: Bain, M, et al.
Cyhoeddwyd: (2020)

Audio-visual synchronisation in the wild
gan: Chen, H, et al.
Cyhoeddwyd: (2021)

Chimpanzee face recognition from videos in the wild using deep learning
gan: Schofield, D, et al.
Cyhoeddwyd: (2019)

Self-supervised utterance order prediction for emotion recognition in conversations
gan: Jiang, Dazhi, et al.
Cyhoeddwyd: (2024)

Playing a part: speaker verification at the movies
gan: Brown, A, et al.
Cyhoeddwyd: (2021)

Forgiving as a performative utterance
gan: Swinburne, R
Cyhoeddwyd: (2021)

Automatic utterance segmentation in spontaneous speech
gan: Yoshida, Norimasa, 1979-
Cyhoeddwyd: (2006)

The anatomy of meaning : speech, gesture, and composite utterances /
gan: 443456 Enfield, N. J.
Cyhoeddwyd: (2009)

Lip reading in the wild
gan: Chung, J, et al.
Cyhoeddwyd: (2017)

Forensic speaker recognition
gan: Bonstre, Jean-Francois, et al.
Cyhoeddwyd: (2010)

Speaker recognition application
gan: Li, Yao Dong.
Cyhoeddwyd: (2013)

Speaker recognition system
gan: Song, Liyan.
Cyhoeddwyd: (2012)

Understanding utterances : The need for pragmatic competence / Kamisah Ariffin
gan: Ariffin, Kamisah
Cyhoeddwyd: (2004)

Out of time: automated lip sync in the wild
gan: Chung, J, et al.
Cyhoeddwyd: (2017)

Acoustic measurements for speaker recognition.
gan: Wolf, Jared John
Cyhoeddwyd: (2005)

Mobile phone speaker recognition
gan: Thang, Hui Ru.
Cyhoeddwyd: (2013)

Speech intelligiblity and speaker recognition/
gan: Hawley, Mones E.
Cyhoeddwyd: (1977)

It's about time: analog clock reading in the wild
gan: Yang, C, et al.
Cyhoeddwyd: (2022)

Lip Reading Sentences in the Wild
gan: Chung, J, et al.
Cyhoeddwyd: (2017)

Slow-fast auditory streams for audio recognition
gan: Kazakos, E, et al.
Cyhoeddwyd: (2021)

Single channel speech separation with constrained utterance level permutation invariant training using grid LSTM
gan: Xu, Chenglin, et al.
Cyhoeddwyd: (2020)

Two-Word Usage: Cognitive Development And The Beginnings Of Combinatorial Utterance.
gan: Zakaria, Noraizzah Haji
Cyhoeddwyd: (1990)

Azuichi, short utterance and responses in Japanese conversation on interview context
gan: Shariff, Muhammad Haikal, et al.
Cyhoeddwyd: (2015)

Utterance verification in large vocabulary spoken language understanding system
gan: Yao, Huan, 1976-
Cyhoeddwyd: (2009)

Processing of speech utterances for computer aided training of speaking skills
gan: Zhao, Sixuan
Cyhoeddwyd: (2014)

Tonal coarticulation in Thai disyllabic utterances: a preliminary study
gan: Gandour, Jack, et al.
Cyhoeddwyd: (2024)

Amodal ground truth and completion in the wild
gan: Zhan, G, et al.
Cyhoeddwyd: (2024)

Design of a speaker recognition system
gan: Zhao, Guangye
Cyhoeddwyd: (2019)

Speaker recognition using neural network
gan: Zul Rasyied, Ab. Rasat
Cyhoeddwyd: (2012)

ASR dependent techniques for speaker recognition
gan: Park, Alex S. (Alex Seungryong), 1979-
Cyhoeddwyd: (2014)

Speaker independent continuous speech recognition
gan: Soon, Ing Yann.
Cyhoeddwyd: (2012)

Mobile phone speaker recognition application
gan: Wong, Joseph Pin Jie
Cyhoeddwyd: (2015)

Mobile phone speaker recognition application
gan: Tan, Xavier Junjie
Cyhoeddwyd: (2014)