Detecting synthetic speech using long term magnitude and phase information

Synthetic speech is speech signals generated by text-to-speech (TTS) and voice conversion (VC) techniques. They impose a threat to speaker verification (SV) systems as an attacker may make use of TTS or VC to synthesize a speakers voice to cheat the SV system. To address this challenge, we study the...

Full description

Bibliographic Details
Main Authors: Tian, Xiaohai, Du, Steven, Xiao, Xiong, Xu, Haihua, Chng, Eng Siong, Li, Haizhou
Other Authors: School of Computer Science and Engineering
Format: Conference Paper
Language:English
Published: 2018
Subjects:
Online Access:https://hdl.handle.net/10356/89638
http://hdl.handle.net/10220/47055