Blueprint Separable Subsampling and Aggregate Feature Conformer-Based End-to-End Neural Diarization
At present, a prevalent approach to speaker diarization is clustering based on speaker embeddings. However, this method encounters two primary issues. Firstly, it cannot directly minimize the diarization error during the training process; secondly, the majority of clustering-based methods struggle t...
Main Authors: | Xiaolin Jiao, Yaqi Chen, Dan Qu, Xukui Yang |
---|---|
Format: | Article |
Language: | English |
Published: |
MDPI AG
2023-10-01
|
Series: | Electronics |
Subjects: | |
Online Access: | https://www.mdpi.com/2079-9292/12/19/4118 |
Similar Items
-
Self-Conditioning via Intermediate Predictions for End-to-End Neural Speaker Diarization
by: Yusuke Fujita, et al.
Published: (2023-01-01) -
Unsupervised adaptation of PLDA models for broadcast diarization
by: Ignacio Viñals, et al.
Published: (2019-12-01) -
Speaker Diarization through Waveform and Neural Net
by: Rustam Latypov, et al.
Published: (2021-05-01) -
Speech Enhancement for Multimodal Speaker Diarization System
by: Rehan Ahmad, et al.
Published: (2020-01-01) -
Speaker Diarization Based on Speech Signal Approximation by Step Function
by: Rustam Latypov, et al.
Published: (2021-01-01)