Deep ATC speaker recognition based on voiceprint aggregation

For the problem of ATC speaker recognition, a method based on voiceprint feature aggregation is proposed, which could distinguish different speakers from an audio stream. First, we develop the ResNet spectrogram feature extractor and the NetVLAD feature fusion module, both of which seldom used in sp...

Full description

Bibliographic Details
Main Author: LI Yin-xuan, TANG Wen-yi, YANG Tao, WANG Xue-chuan, LI Cheng-xiang
Format: Article
Language:zho
Published: Editorial Office of Command Control and Simulation 2023-04-01
Series:Zhihui kongzhi yu fangzhen
Subjects:
Online Access:https://www.zhkzyfz.cn/fileup/1673-3819/PDF/1673-3819(2023)02-0112-04.pdf