ZSE-VITS: A Zero-Shot Expressive Voice Cloning Method Based on VITS

Voice cloning aims to synthesize the voice with a new speaker’s timbre from a small amount of the new speaker’s speech. Current voice cloning methods, which focus on modeling speaker timbre, can synthesize speech with similar speaker timbres. However, the prosody of these methods is flat, lacking ex...

Full description

Bibliographic Details
Main Authors: Jiaxin Li, Lianhai Zhang
Format: Article
Language:English
Published: MDPI AG 2023-02-01
Series:Electronics
Subjects:
Online Access:https://www.mdpi.com/2079-9292/12/4/820