A Large-Scale Benchmark Dataset for Anomaly Detection and Rare Event Classification for Audio Forensics

With the emergence of new digital technologies, a significant surge has been seen in the volume of multimedia data generated from various smart devices. Several challenges for data analysis have emerged to extract useful information from multimedia data. One such challenge is the early and accurate...

Full description

Bibliographic Details
Main Authors: Ahmed Abbasi, Abdul Rehman Rehman Javed, Amanullah Yasin, Zunera Jalil, Natalia Kryvinska, Usman Tariq
Format: Article
Language:English
Published: IEEE 2022-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/9755147/
_version_ 1811300393878028288
author Ahmed Abbasi
Abdul Rehman Rehman Javed
Amanullah Yasin
Zunera Jalil
Natalia Kryvinska
Usman Tariq
author_facet Ahmed Abbasi
Abdul Rehman Rehman Javed
Amanullah Yasin
Zunera Jalil
Natalia Kryvinska
Usman Tariq
author_sort Ahmed Abbasi
collection DOAJ
description With the emergence of new digital technologies, a significant surge has been seen in the volume of multimedia data generated from various smart devices. Several challenges for data analysis have emerged to extract useful information from multimedia data. One such challenge is the early and accurate detection of anomalies in multimedia data. This study proposes an efficient technique for anomaly detection and classification of rare events in audio data. In this paper, we develop a vast audio dataset containing seven different rare events (anomalies) with 15 different background environmental settings (e.g., beach, restaurant, and train) to focus on both detection of anomalous audio and classification of rare sound (e.g., events—baby cry, gunshots, broken glasses, footsteps) events for audio forensics. The proposed approach uses the supreme feature extraction technique by extracting mel-frequency cepstral coefficients (MFCCs) features from the audio signals of the newly created dataset and selects the minimum number of best-performing features for optimum performance using principal component analysis (PCA). These features are input to state-of-the-art machine learning algorithms for performance analysis. We also apply machine learning algorithms to the state-of-the-art dataset and realize good results. Experimental results reveal that the proposed approach effectively detects all anomalies and superior performance to existing approaches in all environments and cases.
first_indexed 2024-04-13T06:51:35Z
format Article
id doaj.art-9fb5b2fa93d74ade80e7a6de9e652d8c
institution Directory Open Access Journal
issn 2169-3536
language English
last_indexed 2024-04-13T06:51:35Z
publishDate 2022-01-01
publisher IEEE
record_format Article
series IEEE Access
spelling doaj.art-9fb5b2fa93d74ade80e7a6de9e652d8c2022-12-22T02:57:24ZengIEEEIEEE Access2169-35362022-01-0110388853889410.1109/ACCESS.2022.31666029755147A Large-Scale Benchmark Dataset for Anomaly Detection and Rare Event Classification for Audio ForensicsAhmed Abbasi0Abdul Rehman Rehman Javed1https://orcid.org/0000-0002-0570-1813Amanullah Yasin2Zunera Jalil3https://orcid.org/0000-0003-2531-2564Natalia Kryvinska4https://orcid.org/0000-0003-3678-9229Usman Tariq5https://orcid.org/0000-0001-7672-1187Faculty of Computing and AI, Air University, Islamabad, PakistanDepartment of Cyber Security, Air University, Islamabad, PakistanDepartment of Creative Technologies, Air University, Islamabad, PakistanDepartment of Cyber Security, Air University, Islamabad, PakistanInformation Systems Department, Faculty of Management, Comenius University in Bratislava, Bratislava, SlovakiaCollege of Computer Engineering and Sciences, Prince Sattam Bin Abdulaziz University, Al-Kharj, Saudi ArabiaWith the emergence of new digital technologies, a significant surge has been seen in the volume of multimedia data generated from various smart devices. Several challenges for data analysis have emerged to extract useful information from multimedia data. One such challenge is the early and accurate detection of anomalies in multimedia data. This study proposes an efficient technique for anomaly detection and classification of rare events in audio data. In this paper, we develop a vast audio dataset containing seven different rare events (anomalies) with 15 different background environmental settings (e.g., beach, restaurant, and train) to focus on both detection of anomalous audio and classification of rare sound (e.g., events—baby cry, gunshots, broken glasses, footsteps) events for audio forensics. The proposed approach uses the supreme feature extraction technique by extracting mel-frequency cepstral coefficients (MFCCs) features from the audio signals of the newly created dataset and selects the minimum number of best-performing features for optimum performance using principal component analysis (PCA). These features are input to state-of-the-art machine learning algorithms for performance analysis. We also apply machine learning algorithms to the state-of-the-art dataset and realize good results. Experimental results reveal that the proposed approach effectively detects all anomalies and superior performance to existing approaches in all environments and cases.https://ieeexplore.ieee.org/document/9755147/Audio forensicsaudio analysisanomaly detectionkey feature extractionfeature selectionmachine learning
spellingShingle Ahmed Abbasi
Abdul Rehman Rehman Javed
Amanullah Yasin
Zunera Jalil
Natalia Kryvinska
Usman Tariq
A Large-Scale Benchmark Dataset for Anomaly Detection and Rare Event Classification for Audio Forensics
IEEE Access
Audio forensics
audio analysis
anomaly detection
key feature extraction
feature selection
machine learning
title A Large-Scale Benchmark Dataset for Anomaly Detection and Rare Event Classification for Audio Forensics
title_full A Large-Scale Benchmark Dataset for Anomaly Detection and Rare Event Classification for Audio Forensics
title_fullStr A Large-Scale Benchmark Dataset for Anomaly Detection and Rare Event Classification for Audio Forensics
title_full_unstemmed A Large-Scale Benchmark Dataset for Anomaly Detection and Rare Event Classification for Audio Forensics
title_short A Large-Scale Benchmark Dataset for Anomaly Detection and Rare Event Classification for Audio Forensics
title_sort large scale benchmark dataset for anomaly detection and rare event classification for audio forensics
topic Audio forensics
audio analysis
anomaly detection
key feature extraction
feature selection
machine learning
url https://ieeexplore.ieee.org/document/9755147/
work_keys_str_mv AT ahmedabbasi alargescalebenchmarkdatasetforanomalydetectionandrareeventclassificationforaudioforensics
AT abdulrehmanrehmanjaved alargescalebenchmarkdatasetforanomalydetectionandrareeventclassificationforaudioforensics
AT amanullahyasin alargescalebenchmarkdatasetforanomalydetectionandrareeventclassificationforaudioforensics
AT zunerajalil alargescalebenchmarkdatasetforanomalydetectionandrareeventclassificationforaudioforensics
AT nataliakryvinska alargescalebenchmarkdatasetforanomalydetectionandrareeventclassificationforaudioforensics
AT usmantariq alargescalebenchmarkdatasetforanomalydetectionandrareeventclassificationforaudioforensics
AT ahmedabbasi largescalebenchmarkdatasetforanomalydetectionandrareeventclassificationforaudioforensics
AT abdulrehmanrehmanjaved largescalebenchmarkdatasetforanomalydetectionandrareeventclassificationforaudioforensics
AT amanullahyasin largescalebenchmarkdatasetforanomalydetectionandrareeventclassificationforaudioforensics
AT zunerajalil largescalebenchmarkdatasetforanomalydetectionandrareeventclassificationforaudioforensics
AT nataliakryvinska largescalebenchmarkdatasetforanomalydetectionandrareeventclassificationforaudioforensics
AT usmantariq largescalebenchmarkdatasetforanomalydetectionandrareeventclassificationforaudioforensics