Identity Vector Extraction by Perceptual Wavelet Packet Entropy and Convolutional Neural Network for Voice Authentication

Recently, the accuracy of voice authentication system has increased significantly due to the successful application of the identity vector (i-vector) model. This paper proposes a new method for i-vector extraction. In the method, a perceptual wavelet packet transform (PWPT) is designed to convert sp...

Full description

Bibliographic Details
Main Authors: Lei Lei, Kun She
Format: Article
Language:English
Published: MDPI AG 2018-08-01
Series:Entropy
Subjects:
Online Access:http://www.mdpi.com/1099-4300/20/8/600
_version_ 1798040205489340416
author Lei Lei
Kun She
author_facet Lei Lei
Kun She
author_sort Lei Lei
collection DOAJ
description Recently, the accuracy of voice authentication system has increased significantly due to the successful application of the identity vector (i-vector) model. This paper proposes a new method for i-vector extraction. In the method, a perceptual wavelet packet transform (PWPT) is designed to convert speech utterances into wavelet entropy feature vectors, and a Convolutional Neural Network (CNN) is designed to estimate the frame posteriors of the wavelet entropy feature vectors. In the end, i-vector is extracted based on those frame posteriors. TIMIT and VoxCeleb speech corpus are used for experiments and the experimental results show that the proposed method can extract appropriate i-vector which reduces the equal error rate (EER) and improve the accuracy of voice authentication system in clean and noisy environment.
first_indexed 2024-04-11T22:04:14Z
format Article
id doaj.art-7d105d9d0c3c44f69aa1c797b3a6ddb1
institution Directory Open Access Journal
issn 1099-4300
language English
last_indexed 2024-04-11T22:04:14Z
publishDate 2018-08-01
publisher MDPI AG
record_format Article
series Entropy
spelling doaj.art-7d105d9d0c3c44f69aa1c797b3a6ddb12022-12-22T04:00:47ZengMDPI AGEntropy1099-43002018-08-0120860010.3390/e20080600e20080600Identity Vector Extraction by Perceptual Wavelet Packet Entropy and Convolutional Neural Network for Voice AuthenticationLei Lei0Kun She1School of Information and Software Engineering, University of Electrical and Science and Technology of China, Chengdu 610054, ChinaSchool of Information and Software Engineering, University of Electrical and Science and Technology of China, Chengdu 610054, ChinaRecently, the accuracy of voice authentication system has increased significantly due to the successful application of the identity vector (i-vector) model. This paper proposes a new method for i-vector extraction. In the method, a perceptual wavelet packet transform (PWPT) is designed to convert speech utterances into wavelet entropy feature vectors, and a Convolutional Neural Network (CNN) is designed to estimate the frame posteriors of the wavelet entropy feature vectors. In the end, i-vector is extracted based on those frame posteriors. TIMIT and VoxCeleb speech corpus are used for experiments and the experimental results show that the proposed method can extract appropriate i-vector which reduces the equal error rate (EER) and improve the accuracy of voice authentication system in clean and noisy environment.http://www.mdpi.com/1099-4300/20/8/600i-vectorwavelet entropyspeaker authenticationCNN
spellingShingle Lei Lei
Kun She
Identity Vector Extraction by Perceptual Wavelet Packet Entropy and Convolutional Neural Network for Voice Authentication
Entropy
i-vector
wavelet entropy
speaker authentication
CNN
title Identity Vector Extraction by Perceptual Wavelet Packet Entropy and Convolutional Neural Network for Voice Authentication
title_full Identity Vector Extraction by Perceptual Wavelet Packet Entropy and Convolutional Neural Network for Voice Authentication
title_fullStr Identity Vector Extraction by Perceptual Wavelet Packet Entropy and Convolutional Neural Network for Voice Authentication
title_full_unstemmed Identity Vector Extraction by Perceptual Wavelet Packet Entropy and Convolutional Neural Network for Voice Authentication
title_short Identity Vector Extraction by Perceptual Wavelet Packet Entropy and Convolutional Neural Network for Voice Authentication
title_sort identity vector extraction by perceptual wavelet packet entropy and convolutional neural network for voice authentication
topic i-vector
wavelet entropy
speaker authentication
CNN
url http://www.mdpi.com/1099-4300/20/8/600
work_keys_str_mv AT leilei identityvectorextractionbyperceptualwaveletpacketentropyandconvolutionalneuralnetworkforvoiceauthentication
AT kunshe identityvectorextractionbyperceptualwaveletpacketentropyandconvolutionalneuralnetworkforvoiceauthentication