A Large-Scale Benchmark Dataset for Anomaly Detection and Rare Event Classification for Audio Forensics
With the emergence of new digital technologies, a significant surge has been seen in the volume of multimedia data generated from various smart devices. Several challenges for data analysis have emerged to extract useful information from multimedia data. One such challenge is the early and accurate...
Main Authors: | , , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
IEEE
2022-01-01
|
Series: | IEEE Access |
Subjects: | |
Online Access: | https://ieeexplore.ieee.org/document/9755147/ |
_version_ | 1811300393878028288 |
---|---|
author | Ahmed Abbasi Abdul Rehman Rehman Javed Amanullah Yasin Zunera Jalil Natalia Kryvinska Usman Tariq |
author_facet | Ahmed Abbasi Abdul Rehman Rehman Javed Amanullah Yasin Zunera Jalil Natalia Kryvinska Usman Tariq |
author_sort | Ahmed Abbasi |
collection | DOAJ |
description | With the emergence of new digital technologies, a significant surge has been seen in the volume of multimedia data generated from various smart devices. Several challenges for data analysis have emerged to extract useful information from multimedia data. One such challenge is the early and accurate detection of anomalies in multimedia data. This study proposes an efficient technique for anomaly detection and classification of rare events in audio data. In this paper, we develop a vast audio dataset containing seven different rare events (anomalies) with 15 different background environmental settings (e.g., beach, restaurant, and train) to focus on both detection of anomalous audio and classification of rare sound (e.g., events—baby cry, gunshots, broken glasses, footsteps) events for audio forensics. The proposed approach uses the supreme feature extraction technique by extracting mel-frequency cepstral coefficients (MFCCs) features from the audio signals of the newly created dataset and selects the minimum number of best-performing features for optimum performance using principal component analysis (PCA). These features are input to state-of-the-art machine learning algorithms for performance analysis. We also apply machine learning algorithms to the state-of-the-art dataset and realize good results. Experimental results reveal that the proposed approach effectively detects all anomalies and superior performance to existing approaches in all environments and cases. |
first_indexed | 2024-04-13T06:51:35Z |
format | Article |
id | doaj.art-9fb5b2fa93d74ade80e7a6de9e652d8c |
institution | Directory Open Access Journal |
issn | 2169-3536 |
language | English |
last_indexed | 2024-04-13T06:51:35Z |
publishDate | 2022-01-01 |
publisher | IEEE |
record_format | Article |
series | IEEE Access |
spelling | doaj.art-9fb5b2fa93d74ade80e7a6de9e652d8c2022-12-22T02:57:24ZengIEEEIEEE Access2169-35362022-01-0110388853889410.1109/ACCESS.2022.31666029755147A Large-Scale Benchmark Dataset for Anomaly Detection and Rare Event Classification for Audio ForensicsAhmed Abbasi0Abdul Rehman Rehman Javed1https://orcid.org/0000-0002-0570-1813Amanullah Yasin2Zunera Jalil3https://orcid.org/0000-0003-2531-2564Natalia Kryvinska4https://orcid.org/0000-0003-3678-9229Usman Tariq5https://orcid.org/0000-0001-7672-1187Faculty of Computing and AI, Air University, Islamabad, PakistanDepartment of Cyber Security, Air University, Islamabad, PakistanDepartment of Creative Technologies, Air University, Islamabad, PakistanDepartment of Cyber Security, Air University, Islamabad, PakistanInformation Systems Department, Faculty of Management, Comenius University in Bratislava, Bratislava, SlovakiaCollege of Computer Engineering and Sciences, Prince Sattam Bin Abdulaziz University, Al-Kharj, Saudi ArabiaWith the emergence of new digital technologies, a significant surge has been seen in the volume of multimedia data generated from various smart devices. Several challenges for data analysis have emerged to extract useful information from multimedia data. One such challenge is the early and accurate detection of anomalies in multimedia data. This study proposes an efficient technique for anomaly detection and classification of rare events in audio data. In this paper, we develop a vast audio dataset containing seven different rare events (anomalies) with 15 different background environmental settings (e.g., beach, restaurant, and train) to focus on both detection of anomalous audio and classification of rare sound (e.g., events—baby cry, gunshots, broken glasses, footsteps) events for audio forensics. The proposed approach uses the supreme feature extraction technique by extracting mel-frequency cepstral coefficients (MFCCs) features from the audio signals of the newly created dataset and selects the minimum number of best-performing features for optimum performance using principal component analysis (PCA). These features are input to state-of-the-art machine learning algorithms for performance analysis. We also apply machine learning algorithms to the state-of-the-art dataset and realize good results. Experimental results reveal that the proposed approach effectively detects all anomalies and superior performance to existing approaches in all environments and cases.https://ieeexplore.ieee.org/document/9755147/Audio forensicsaudio analysisanomaly detectionkey feature extractionfeature selectionmachine learning |
spellingShingle | Ahmed Abbasi Abdul Rehman Rehman Javed Amanullah Yasin Zunera Jalil Natalia Kryvinska Usman Tariq A Large-Scale Benchmark Dataset for Anomaly Detection and Rare Event Classification for Audio Forensics IEEE Access Audio forensics audio analysis anomaly detection key feature extraction feature selection machine learning |
title | A Large-Scale Benchmark Dataset for Anomaly Detection and Rare Event Classification for Audio Forensics |
title_full | A Large-Scale Benchmark Dataset for Anomaly Detection and Rare Event Classification for Audio Forensics |
title_fullStr | A Large-Scale Benchmark Dataset for Anomaly Detection and Rare Event Classification for Audio Forensics |
title_full_unstemmed | A Large-Scale Benchmark Dataset for Anomaly Detection and Rare Event Classification for Audio Forensics |
title_short | A Large-Scale Benchmark Dataset for Anomaly Detection and Rare Event Classification for Audio Forensics |
title_sort | large scale benchmark dataset for anomaly detection and rare event classification for audio forensics |
topic | Audio forensics audio analysis anomaly detection key feature extraction feature selection machine learning |
url | https://ieeexplore.ieee.org/document/9755147/ |
work_keys_str_mv | AT ahmedabbasi alargescalebenchmarkdatasetforanomalydetectionandrareeventclassificationforaudioforensics AT abdulrehmanrehmanjaved alargescalebenchmarkdatasetforanomalydetectionandrareeventclassificationforaudioforensics AT amanullahyasin alargescalebenchmarkdatasetforanomalydetectionandrareeventclassificationforaudioforensics AT zunerajalil alargescalebenchmarkdatasetforanomalydetectionandrareeventclassificationforaudioforensics AT nataliakryvinska alargescalebenchmarkdatasetforanomalydetectionandrareeventclassificationforaudioforensics AT usmantariq alargescalebenchmarkdatasetforanomalydetectionandrareeventclassificationforaudioforensics AT ahmedabbasi largescalebenchmarkdatasetforanomalydetectionandrareeventclassificationforaudioforensics AT abdulrehmanrehmanjaved largescalebenchmarkdatasetforanomalydetectionandrareeventclassificationforaudioforensics AT amanullahyasin largescalebenchmarkdatasetforanomalydetectionandrareeventclassificationforaudioforensics AT zunerajalil largescalebenchmarkdatasetforanomalydetectionandrareeventclassificationforaudioforensics AT nataliakryvinska largescalebenchmarkdatasetforanomalydetectionandrareeventclassificationforaudioforensics AT usmantariq largescalebenchmarkdatasetforanomalydetectionandrareeventclassificationforaudioforensics |