Speaker Recognition Based on the Joint Loss Function
The statistical pyramid dense time-delay neural network (SPD-TDNN) model makes it difficult to deal with the imbalance of training data, poses a high risk of overfitting, and has weak generalization ability. To solve these problems, we propose a method based on the joint loss function and improved s...
Main Authors: | Tengteng Feng, Houbin Fan, Fengpei Ge, Shuxin Cao, Chunyan Liang |
---|---|
Format: | Article |
Language: | English |
Published: |
MDPI AG
2023-08-01
|
Series: | Electronics |
Subjects: | |
Online Access: | https://www.mdpi.com/2079-9292/12/16/3447 |
Similar Items
-
Deep ATC speaker recognition based on voiceprint aggregation
by: LI Yin-xuan, TANG Wen-yi, YANG Tao, WANG Xue-chuan, LI Cheng-xiang
Published: (2023-04-01) -
Two-Step Joint Optimization with Auxiliary Loss Function for Noise-Robust Speech Recognition
by: Geon Woo Lee, et al.
Published: (2022-07-01) -
DS-GAU: Dual-sequences gated attention unit architecture for text-independent speaker verification
by: Tsung-Han Tsai, et al.
Published: (2023-09-01) -
Weighted Cluster-Range Loss and Criticality-Enhancement Loss for Speaker Recognition
by: Jianye Mo, et al.
Published: (2020-12-01) -
Masked multi-center angular margin loss for language recognition
by: Minghang Ju, et al.
Published: (2022-07-01)