A dataset for voice-based human identity recognition

This paper introduces a new English speech dataset suitable for training and evaluating speaker recognition systems. Samples were obtained from non-native English speakers from the Arab region over the course of two months. The dataset was divided into two sub-datasets. Ten samples were collected fr...

Full description

Bibliographic Details
Main Authors:	Baha’ A. Alsaify, Hadeel S. Abu Arja, Baskal Y. Maayah, Masa M. Al-Taweel
Format:	Article
Language:	English
Published:	Elsevier 2022-06-01
Series:	Data in Brief
Subjects:	FLAC Same phrase Audio dataset Different phrase Voice recognition Applied machine learning
Online Access:	http://www.sciencedirect.com/science/article/pii/S2352340922002815

Description
Summary:	This paper introduces a new English speech dataset suitable for training and evaluating speaker recognition systems. Samples were obtained from non-native English speakers from the Arab region over the course of two months. The dataset was divided into two sub-datasets. Ten samples were collected from each speaker for each sub-dataset. The first sub-dataset contains samples of speakers repeating the phrase “Machine learning 1, 2, 3, 4, 5, 6, 7, 8, 9, 10”. The second sub-dataset contains samples for the same speakers speaking randomly for five to ten seconds for each sample. The dataset consists of 150 speakers with a total of 3,000 data samples and about six hours of speech.
ISSN:	2352-3409

A dataset for voice-based human identity recognition

Similar Items