Utterance-level aggregation for speaker recognition in the wild

Utterance-level aggregation for speaker recognition in the wild

The objective of this paper is speaker recognition `in the wild' - where utterances may be of variable length and also contain irrelevant signals. Crucial elements in the design of deep networks for this task are the type of trunk (frame level) network, and the method of temporal aggregation. W...

Bibliografski detalji
Glavni autori:	Xie, W, Nagrani, A, Chung, J, Zisserman, A
Format:	Conference item
Izdano:	IEEE 2019

Slični predmeti

Voxceleb: large-scale speaker verification in the wild
od: Nagrani, A, i dr.
Izdano: (2019)

Spot the conversation: Speaker diarisation in the wild
od: Chung, JS, i dr.
Izdano: (2020)

VoxCeleb2: Deep speaker recognition
od: Chung, J, i dr.
Izdano: (2018)

The VoxCeleb speaker recognition challenge: a retrospective
od: Huh, J, i dr.
Izdano: (2024)

VoxCeleb: a large-scale speaker identification dataset
od: Nagrani, A, i dr.
Izdano: (2017)

Emotion Recognition in Speech using Cross-Modal Transfer in the Wild
od: Albanie, S, i dr.
Izdano: (2018)

Emotion recognition in speech using cross-modal transfer in the wild
od: Albanie, S, i dr.
Izdano: (2018)

Count, crop and recognise: fine-grained recognition in the wild
od: Bain, M, i dr.
Izdano: (2020)

Audio-visual synchronisation in the wild
od: Chen, H, i dr.
Izdano: (2021)

Chimpanzee face recognition from videos in the wild using deep learning
od: Schofield, D, i dr.
Izdano: (2019)

Self-supervised utterance order prediction for emotion recognition in conversations
od: Jiang, Dazhi, i dr.
Izdano: (2024)

Playing a part: speaker verification at the movies
od: Brown, A, i dr.
Izdano: (2021)

Forgiving as a performative utterance
od: Swinburne, R
Izdano: (2021)

Automatic utterance segmentation in spontaneous speech
od: Yoshida, Norimasa, 1979-
Izdano: (2006)

The anatomy of meaning : speech, gesture, and composite utterances /
od: 443456 Enfield, N. J.
Izdano: (2009)

Lip reading in the wild
od: Chung, J, i dr.
Izdano: (2017)

Forensic speaker recognition
od: Bonstre, Jean-Francois, i dr.
Izdano: (2010)

Speaker recognition application
od: Li, Yao Dong.
Izdano: (2013)

Speaker recognition system
od: Song, Liyan.
Izdano: (2012)

Understanding utterances : The need for pragmatic competence / Kamisah Ariffin
od: Ariffin, Kamisah
Izdano: (2004)

Acoustic measurements for speaker recognition.
od: Wolf, Jared John
Izdano: (2005)

Mobile phone speaker recognition
od: Thang, Hui Ru.
Izdano: (2013)

Speech intelligiblity and speaker recognition/
od: Hawley, Mones E.
Izdano: (1977)

Out of time: automated lip sync in the wild
od: Chung, J, i dr.
Izdano: (2017)

It's about time: analog clock reading in the wild
od: Yang, C, i dr.
Izdano: (2022)

Slow-fast auditory streams for audio recognition
od: Kazakos, E, i dr.
Izdano: (2021)

Lip Reading Sentences in the Wild
od: Chung, J, i dr.
Izdano: (2017)

Single channel speech separation with constrained utterance level permutation invariant training using grid LSTM
od: Xu, Chenglin, i dr.
Izdano: (2020)

Two-Word Usage: Cognitive Development And The Beginnings Of Combinatorial Utterance.
od: Zakaria, Noraizzah Haji
Izdano: (1990)

Azuichi, short utterance and responses in Japanese conversation on interview context
od: Shariff, Muhammad Haikal, i dr.
Izdano: (2015)

Utterance verification in large vocabulary spoken language understanding system
od: Yao, Huan, 1976-
Izdano: (2009)

Processing of speech utterances for computer aided training of speaking skills
od: Zhao, Sixuan
Izdano: (2014)

Tonal coarticulation in Thai disyllabic utterances: a preliminary study
od: Gandour, Jack, i dr.
Izdano: (2024)

Amodal ground truth and completion in the wild
od: Zhan, G, i dr.
Izdano: (2024)

Design of a speaker recognition system
od: Zhao, Guangye
Izdano: (2019)

Speaker recognition using neural network
od: Zul Rasyied, Ab. Rasat
Izdano: (2012)

ASR dependent techniques for speaker recognition
od: Park, Alex S. (Alex Seungryong), 1979-
Izdano: (2014)

Speaker independent continuous speech recognition
od: Soon, Ing Yann.
Izdano: (2012)

Mobile phone speaker recognition application
od: Wong, Joseph Pin Jie
Izdano: (2015)

Mobile phone speaker recognition application
od: Tan, Xavier Junjie
Izdano: (2014)