BAAD: A multipurpose dataset for automatic Bangla offensive speech recognition

In spite of being the fifth most spoken native language in the world, Bangla has barely received any attention in the domain of audio and speech recognition. This article represents a speech dataset of Bengali Abusive Words with some non-abusive wors which are very close to the abusive ones. In this...

Full description

Bibliographic Details
Main Authors: Md. Fahad Hossain, Md. Al Abid Supto, Zannat Chowdhury, Hana Sultan Chowdhury, Sheikh Abujar
Format: Article
Language:English
Published: Elsevier 2023-06-01
Series:Data in Brief
Subjects:
Online Access:http://www.sciencedirect.com/science/article/pii/S2352340923001853