Variational autoencoder for prosody-based speaker recognition

This paper describes a novel end-to-end deep generative model-based speaker recognition system using prosodic features. The usefulness of variational autoencoders (VAE) in learning the speaker-specific prosody representations for the speaker recognition task is examined herein for the first time. Th...

Full description

Bibliographic Details
Main Authors: Starlet Ben Alex, Leena Mary
Format: Article
Language:English
Published: Electronics and Telecommunications Research Institute (ETRI) 2023-08-01
Series:ETRI Journal
Subjects:
Online Access:https://doi.org/10.4218/etrij.2021-0377