Deep ATC speaker recognition based on voiceprint aggregation

Deep ATC speaker recognition based on voiceprint aggregation

For the problem of ATC speaker recognition, a method based on voiceprint feature aggregation is proposed, which could distinguish different speakers from an audio stream. First, we develop the ResNet spectrogram feature extractor and the NetVLAD feature fusion module, both of which seldom used in sp...

Full description

Bibliographic Details
Main Author:	LI Yin-xuan, TANG Wen-yi, YANG Tao, WANG Xue-chuan, LI Cheng-xiang
Format:	Article
Language:	zho
Published:	Editorial Office of Command Control and Simulation 2023-04-01
Series:	Zhihui kongzhi yu fangzhen
Subjects:	tdnn\|feature aggregation\|vlad\|atc voice
Online Access:	https://www.zhkzyfz.cn/fileup/1673-3819/PDF/1673-3819(2023)02-0112-04.pdf

Similar Items

DS-GAU: Dual-sequences gated attention unit architecture for text-independent speaker verification
by: Tsung-Han Tsai, et al.
Published: (2023-09-01)

Aggregating Local Descriptors for Epigraphs Recognition
by: Giuseppe Amato, et al.
Published: (2014-09-01)

Deep Tree Net-Vector of Locally Aggregated Descriptor (VLAD) Model
by: Abduljawad A. Amory, et al.
Published: (2019-01-01)

ATC/DDD - классификационная система в фармакоэпидемиологических исследованиях
Published: (2018-06-01)

Structure, Intent & Conformance Monitoring in ATC
by: Reynolds, Tom G., et al.
Published: (2007)

THE APPLICATION OF EGP MATERIALS TO ATC STUDENTS OF CASEA MAKASSAR
by: Agus Rahmat, et al.
Published: (2017-01-01)

OPTIMAL ALLOCATION OF TCSC DEVICES FOR THE ENHANCEMENT OF ATC IN DEREGULATED POWER SYSTEM USING FLOWER POLLINATION ALGORITHM
by: K. T. VENKATRAMAN, et al.
Published: (2018-09-01)

Voiceprint Recognition under Cross-Scenario Conditions Using Perceptual Wavelet Packet Entropy-Guided Efficient-Channel-Attention–Res2Net–Time-Delay-Neural-Network Model
by: Shuqi Wang, et al.
Published: (2023-10-01)

Vehicle Detection in Aerial Images Using a Fast Oriented Region Search and the Vector of Locally Aggregated Descriptors
by: Chongyang Liu, et al.
Published: (2019-07-01)

A Deep Diacritics-Based Recognition Model for Arabic Speech: Quranic Verses as Case Study
by: Sarah S. Alrumiah, et al.
Published: (2023-01-01)

Prevalence of the most frequent ATC groups and subgroups in polypharmacized patients of Emergency medical service Belgrade
by: Petrov-Kiurski Miloranka, et al.
Published: (2016-01-01)

Enhancing Speaker Recognition with CRET Model: a fusion of CONV2D, RESNET and ECAPA-TDNN
by: Pinyan Li, et al.
Published: (2025-02-01)

Measuring Medicine Use: Applying ATC/DDD Methodology to Real-World Data
by: Samantha Hollingworth, et al.
Published: (2021-03-01)

The application of FGD to support concept of National Policy on health and safety work procedures on ATC employees in Indonesia
by: Lalu Muhammad Saleh, et al.
Published: (2021-01-01)

Rainfall prediction using time-delay wavelet neural network (TDWNN) model for assessing agrometeorological risk
by: MRINMOY RAY, et al.
Published: (2023-02-01)

Design of an ATC Tool for Conflict Detection Based on Machine Learning Techniques
by: Javier Alberto Pérez-Castán, et al.
Published: (2022-01-01)

Target Speaker Extraction by Fusing Voiceprint Features
by: Shidan Cheng, et al.
Published: (2022-08-01)

Assessment of Prescribing Pattern of antibiotics by Dentists in Ardabil city Based on the International ATC/DDD System
by: Mehrnoosh Kaviani, et al.
Published: (2024-07-01)

Analisis Dampak Pemasangan ATCS Terhadap Emisi Gas Buang (CO2) di Jl. Jend. Sudirman Kota Tangerang
by: Wahyu Jatmiko
Published: (2013-06-01)

The Quality of Alkali Treated Cottonii (ATC) Made from Eucheuma cottonii Collected from Different Regions In Indonesia
by: muhamad darmawan, et al.
Published: (2013-12-01)

Exploring Aggregated wav2vec 2.0 Features and Dual-Stream TDNN for Efficient Spoken Dialect Identification
by: Ananya Angra, et al.
Published: (2025-01-01)

EFFECT OF CHOPPING STEP AND DRYING TECHNIQUE ON THE QUALITY OF ALKALI TREATED COTTONII (ATC)
by: Singgih Wibowo, et al.
Published: (2013-05-01)

ارزیابی آسیب پذیری لرزه ای بیمارستان های شهر یاسوج از دیدگاه پدافند غیرعامل و روش(ATC)
by: مهرداد خلقی فرد, et al.
Published: (2021-05-01)

Uji Coba Proses Daur Ulang Limbah Cair ATC (Alkali Treated Cottonii) Dengan Teknik Koagulasi dan Filtrasi
by: Bakti Berlyanto Sedayu, et al.
Published: (2007-12-01)

NEW APPROACH FOR ONLINE ARABIC MANUSCRIPT RECOGNITION BY DEEP BELIEF NETWORK
by: Benbakreti Samir, et al.
Published: (2018-10-01)

ANALISIS ATTITUDES TOWARD CHEMISTRY (ATC) PADA MATERI KIMIA DASAR DENGAN PEMBELAJARAN DALAM JARINGAN (DARING)
by: Nur Alawiyah, et al.
Published: (2021-05-01)

From Vlad Țepeş to Count Dracula. A Challenging Relation between History and Myth
by: Ovidiu Ivancu
Published: (2019-10-01)

Unifying Deep ConvNet and Semantic Edge Features for Loop Closure Detection
by: Jie Jin, et al.
Published: (2022-09-01)

THE ANALYSIS OF WORK OF THE ATC SECTORS OF MOSCOW ATM CENTER, BASED ON THE STATISTICAL DATA
by: N. I. Divak
Published: (2016-11-01)

The Protective Role of Troxerutin (Trox) in Counteracting Anaplastic Thyroid Carcinoma (ATC) Progression
by: Valentina Bova, et al.
Published: (2024-08-01)

EVALUASI KINERJA STRUKTUR BANGUNAN MENGGUNAKAN PUSHOVER ANALYSIS DENGAN METODE ATC-40 DAN FEMA 356
by: R. Hendarto Prasetyo R. Bambang Kusuma Prihadi, et al.
Published: (2020-01-01)

Antibacterial Activity of Siwak (Salvadora persica Linn.) against Streptococcus mutans (ATC31987) and Bacteroides melaninogenicus
by: Zaenab Zaenab, et al.
Published: (2010-10-01)

Gas-bearing prediction of deep reservoir based on DNN embeddings
by: Shuying Ma, et al.
Published: (2023-04-01)

Corrigendum: Gas-bearing prediction of deep reservoir based on DNN embeddings
by: Shuying Ma, et al.
Published: (2023-05-01)

Optimization of Processing Conditions of Alkali Treated Cottonii (ATC) from Sap-free Eucheuma Cottonii
by: Fateha Fateha, et al.
Published: (2019-08-01)

Joint Learning of NNeXtVLAD, CNN and Context Gating for Micro-Video Venue Classification
by: Wei Liu, et al.
Published: (2019-01-01)

A Study of Speech Recognition for Kazakh Based on Unsupervised Pre-Training
by: Weijing Meng, et al.
Published: (2023-01-01)

Speaker Verification Combining Total Variability Space and Time Delay Neural Network
by: QU Yuquan, LONG Hua, DUAN Ying, SHAO Yubin, DU Qingzhi
Published: (2021-07-01)

A novel smart photoelectric lock system: Speech transmitted by laser and speech to text
by: Cheng-Yan Guo, et al.
Published: (2023-03-01)

A Novel Method of Aircraft Detection under Complex Background Based on Circular Intensity Filter and Rotation Invariant Feature
by: Xin Chen, et al.
Published: (2022-01-01)