Deep ATC speaker recognition based on voiceprint aggregation
For the problem of ATC speaker recognition, a method based on voiceprint feature aggregation is proposed, which could distinguish different speakers from an audio stream. First, we develop the ResNet spectrogram feature extractor and the NetVLAD feature fusion module, both of which seldom used in sp...
Main Author: | LI Yin-xuan, TANG Wen-yi, YANG Tao, WANG Xue-chuan, LI Cheng-xiang |
---|---|
Format: | Article |
Language: | zho |
Published: |
Editorial Office of Command Control and Simulation
2023-04-01
|
Series: | Zhihui kongzhi yu fangzhen |
Subjects: | |
Online Access: | https://www.zhkzyfz.cn/fileup/1673-3819/PDF/1673-3819(2023)02-0112-04.pdf |
Similar Items
-
DS-GAU: Dual-sequences gated attention unit architecture for text-independent speaker verification
by: Tsung-Han Tsai, et al.
Published: (2023-09-01) -
Aggregating Local Descriptors for Epigraphs Recognition
by: Giuseppe Amato, et al.
Published: (2014-09-01) -
Deep Tree Net-Vector of Locally Aggregated Descriptor (VLAD) Model
by: Abduljawad A. Amory, et al.
Published: (2019-01-01) -
ATC/DDD - классификационная система в фармакоэпидемиологических исследованиях
Published: (2018-06-01) -
Structure, Intent & Conformance Monitoring in ATC
by: Reynolds, Tom G., et al.
Published: (2007)