Variational autoencoder for prosody-based speaker recognition

This paper describes a novel end-to-end deep generative model-based speaker recognition system using prosodic features. The usefulness of variational autoencoders (VAE) in learning the speaker-specific prosody representations for the speaker recognition task is examined herein for the first time. Th...

Full description

Bibliographic Details
Main Authors:	Starlet Ben Alex, Leena Mary
Format:	Article
Language:	English
Published:	Electronics and Telecommunications Research Institute (ETRI) 2023-08-01
Series:	ETRI Journal
Subjects:	deep neural networks prosodic features speaker recognition syllables vae
Online Access:	https://doi.org/10.4218/etrij.2021-0377

Internet

https://doi.org/10.4218/etrij.2021-0377

Variational autoencoder for prosody-based speaker recognition

Internet

Similar Items