A Large-Scale Benchmark Dataset for Anomaly Detection and Rare Event Classification for Audio Forensics

With the emergence of new digital technologies, a significant surge has been seen in the volume of multimedia data generated from various smart devices. Several challenges for data analysis have emerged to extract useful information from multimedia data. One such challenge is the early and accurate...

Full description

Bibliographic Details
Main Authors:	Ahmed Abbasi, Abdul Rehman Rehman Javed, Amanullah Yasin, Zunera Jalil, Natalia Kryvinska, Usman Tariq
Format:	Article
Language:	English
Published:	IEEE 2022-01-01
Series:	IEEE Access
Subjects:	Audio forensics audio analysis anomaly detection key feature extraction feature selection machine learning
Online Access:	https://ieeexplore.ieee.org/document/9755147/

_version_	1811300393878028288
author	Ahmed Abbasi Abdul Rehman Rehman Javed Amanullah Yasin Zunera Jalil Natalia Kryvinska Usman Tariq
author_facet	Ahmed Abbasi Abdul Rehman Rehman Javed Amanullah Yasin Zunera Jalil Natalia Kryvinska Usman Tariq
author_sort	Ahmed Abbasi
collection	DOAJ
description	With the emergence of new digital technologies, a significant surge has been seen in the volume of multimedia data generated from various smart devices. Several challenges for data analysis have emerged to extract useful information from multimedia data. One such challenge is the early and accurate detection of anomalies in multimedia data. This study proposes an efficient technique for anomaly detection and classification of rare events in audio data. In this paper, we develop a vast audio dataset containing seven different rare events (anomalies) with 15 different background environmental settings (e.g., beach, restaurant, and train) to focus on both detection of anomalous audio and classification of rare sound (e.g., events—baby cry, gunshots, broken glasses, footsteps) events for audio forensics. The proposed approach uses the supreme feature extraction technique by extracting mel-frequency cepstral coefficients (MFCCs) features from the audio signals of the newly created dataset and selects the minimum number of best-performing features for optimum performance using principal component analysis (PCA). These features are input to state-of-the-art machine learning algorithms for performance analysis. We also apply machine learning algorithms to the state-of-the-art dataset and realize good results. Experimental results reveal that the proposed approach effectively detects all anomalies and superior performance to existing approaches in all environments and cases.
first_indexed	2024-04-13T06:51:35Z
format	Article
id	doaj.art-9fb5b2fa93d74ade80e7a6de9e652d8c
institution	Directory Open Access Journal
issn	2169-3536
language	English
last_indexed	2024-04-13T06:51:35Z
publishDate	2022-01-01
publisher	IEEE
record_format	Article
series	IEEE Access
spelling	doaj.art-9fb5b2fa93d74ade80e7a6de9e652d8c2022-12-22T02:57:24ZengIEEEIEEE Access2169-35362022-01-0110388853889410.1109/ACCESS.2022.31666029755147A Large-Scale Benchmark Dataset for Anomaly Detection and Rare Event Classification for Audio ForensicsAhmed Abbasi0Abdul Rehman Rehman Javed1https://orcid.org/0000-0002-0570-1813Amanullah Yasin2Zunera Jalil3https://orcid.org/0000-0003-2531-2564Natalia Kryvinska4https://orcid.org/0000-0003-3678-9229Usman Tariq5https://orcid.org/0000-0001-7672-1187Faculty of Computing and AI, Air University, Islamabad, PakistanDepartment of Cyber Security, Air University, Islamabad, PakistanDepartment of Creative Technologies, Air University, Islamabad, PakistanDepartment of Cyber Security, Air University, Islamabad, PakistanInformation Systems Department, Faculty of Management, Comenius University in Bratislava, Bratislava, SlovakiaCollege of Computer Engineering and Sciences, Prince Sattam Bin Abdulaziz University, Al-Kharj, Saudi ArabiaWith the emergence of new digital technologies, a significant surge has been seen in the volume of multimedia data generated from various smart devices. Several challenges for data analysis have emerged to extract useful information from multimedia data. One such challenge is the early and accurate detection of anomalies in multimedia data. This study proposes an efficient technique for anomaly detection and classification of rare events in audio data. In this paper, we develop a vast audio dataset containing seven different rare events (anomalies) with 15 different background environmental settings (e.g., beach, restaurant, and train) to focus on both detection of anomalous audio and classification of rare sound (e.g., events—baby cry, gunshots, broken glasses, footsteps) events for audio forensics. The proposed approach uses the supreme feature extraction technique by extracting mel-frequency cepstral coefficients (MFCCs) features from the audio signals of the newly created dataset and selects the minimum number of best-performing features for optimum performance using principal component analysis (PCA). These features are input to state-of-the-art machine learning algorithms for performance analysis. We also apply machine learning algorithms to the state-of-the-art dataset and realize good results. Experimental results reveal that the proposed approach effectively detects all anomalies and superior performance to existing approaches in all environments and cases.https://ieeexplore.ieee.org/document/9755147/Audio forensicsaudio analysisanomaly detectionkey feature extractionfeature selectionmachine learning
spellingShingle	Ahmed Abbasi Abdul Rehman Rehman Javed Amanullah Yasin Zunera Jalil Natalia Kryvinska Usman Tariq A Large-Scale Benchmark Dataset for Anomaly Detection and Rare Event Classification for Audio Forensics IEEE Access Audio forensics audio analysis anomaly detection key feature extraction feature selection machine learning
title	A Large-Scale Benchmark Dataset for Anomaly Detection and Rare Event Classification for Audio Forensics
title_full	A Large-Scale Benchmark Dataset for Anomaly Detection and Rare Event Classification for Audio Forensics
title_fullStr	A Large-Scale Benchmark Dataset for Anomaly Detection and Rare Event Classification for Audio Forensics
title_full_unstemmed	A Large-Scale Benchmark Dataset for Anomaly Detection and Rare Event Classification for Audio Forensics
title_short	A Large-Scale Benchmark Dataset for Anomaly Detection and Rare Event Classification for Audio Forensics
title_sort	large scale benchmark dataset for anomaly detection and rare event classification for audio forensics
topic	Audio forensics audio analysis anomaly detection key feature extraction feature selection machine learning
url	https://ieeexplore.ieee.org/document/9755147/
work_keys_str_mv	AT ahmedabbasi alargescalebenchmarkdatasetforanomalydetectionandrareeventclassificationforaudioforensics AT abdulrehmanrehmanjaved alargescalebenchmarkdatasetforanomalydetectionandrareeventclassificationforaudioforensics AT amanullahyasin alargescalebenchmarkdatasetforanomalydetectionandrareeventclassificationforaudioforensics AT zunerajalil alargescalebenchmarkdatasetforanomalydetectionandrareeventclassificationforaudioforensics AT nataliakryvinska alargescalebenchmarkdatasetforanomalydetectionandrareeventclassificationforaudioforensics AT usmantariq alargescalebenchmarkdatasetforanomalydetectionandrareeventclassificationforaudioforensics AT ahmedabbasi largescalebenchmarkdatasetforanomalydetectionandrareeventclassificationforaudioforensics AT abdulrehmanrehmanjaved largescalebenchmarkdatasetforanomalydetectionandrareeventclassificationforaudioforensics AT amanullahyasin largescalebenchmarkdatasetforanomalydetectionandrareeventclassificationforaudioforensics AT zunerajalil largescalebenchmarkdatasetforanomalydetectionandrareeventclassificationforaudioforensics AT nataliakryvinska largescalebenchmarkdatasetforanomalydetectionandrareeventclassificationforaudioforensics AT usmantariq largescalebenchmarkdatasetforanomalydetectionandrareeventclassificationforaudioforensics

A Large-Scale Benchmark Dataset for Anomaly Detection and Rare Event Classification for Audio Forensics

Similar Items