Deep Learning for Audio Event Detection and Tagging on Low-Resource Datasets

Deep Learning for Audio Event Detection and Tagging on Low-Resource Datasets

In training a deep learning system to perform audio transcription, two practical problems may arise. Firstly, most datasets are weakly labelled, having only a list of events present in each recording without any temporal information for training. Secondly, deep neural networks need a very large amou...

Full description

Bibliographic Details
Main Authors:	Veronica Morfi, Dan Stowell
Format:	Article
Language:	English
Published:	MDPI AG 2018-08-01
Series:	Applied Sciences
Subjects:	deep learning multi-task learning audio event detection audio tagging weak learning low-resource data
Online Access:	http://www.mdpi.com/2076-3417/8/8/1397

Similar Items

Deep multiple instance learning for foreground speech localization in ambient audio from wearable devices
by: Rajat Hebbar, et al.
Published: (2021-02-01)

Deep Convolutional Neural Network with Structured Prediction for Weakly Supervised Audio Event Detection
by: Inkyu Choi, et al.
Published: (2019-06-01)

Classification of Overlapped Audio Events Based on AT, PLSA, and the Combination of Them
by: Y. Leng, et al.
Published: (2015-06-01)

A Large-Scale Benchmark Dataset for Anomaly Detection and Rare Event Classification for Audio Forensics
by: Ahmed Abbasi, et al.
Published: (2022-01-01)

VI-PANN: Harnessing Transfer Learning and Uncertainty-Aware Variational Inference for Improved Generalization in Audio Pattern Recognition
by: John Fischer, et al.
Published: (2024-01-01)

NIPS4Bplus: a richly annotated birdsong audio dataset
by: Veronica Morfi, et al.
Published: (2019-10-01)

AudioPairBank: towards a large-scale tag-pair-based audio content analysis
by: Sebastian Säger, et al.
Published: (2018-09-01)

Automated Audio Captioning With Topic Modeling
by: Aysegul Ozkaya Eren, et al.
Published: (2023-01-01)

Multi-rate modulation encoding via unsupervised learning for audio event detection
by: Sandeep Reddy Kothinti, et al.
Published: (2024-04-01)

Audio intelligent monitoring at the edge (AIME) for polyphonic sound sources
by: Lim, Victor
Published: (2024)

Automated Event Detection and Classification in Soccer: The Potential of Using Multiple Modalities
by: Olav Andre Nergård Rongved, et al.
Published: (2021-12-01)

Auditory Suspicious Event Databases: DASE and Bi-DASE
by: Buket D. Barkana, et al.
Published: (2018-01-01)

Reviews on Technology and Standard of Spatial Audio Coding
by: Ikhwana Elfitri, et al.
Published: (2017-03-01)

A Review of Modern Audio Deepfake Detection Methods: Challenges and Future Directions
by: Zaynab Almutairi, et al.
Published: (2022-05-01)

Deepfake Audio Detection via MFCC Features Using Machine Learning
by: Ameer Hamza, et al.
Published: (2022-01-01)

WaveBYOL: Self-Supervised Learning for Audio Representation From Raw Waveforms
by: Sunghyun Kim, et al.
Published: (2023-01-01)

MMATERIC: Multi-Task Learning and Multi-Fusion for AudioText Emotion Recognition in Conversation
by: Xingwei Liang, et al.
Published: (2023-03-01)

Toward Audio Beehive Monitoring: Deep Learning vs. Standard Machine Learning in Classifying Beehive Audio Samples
by: Vladimir Kulyukin, et al.
Published: (2018-09-01)

On Practical Issues of Electric Network Frequency Based Audio Forensics
by: Guang Hua, et al.
Published: (2017-01-01)

Audio-visual deep learning
by: Afouras, T, et al.
Published: (2021)

Violence Detection in Audio: Evaluating the Effectiveness of Deep Learning Models and Data Augmentation
by: Dalila Durães, et al.
Published: (2023-09-01)

Using audio-visual material to enhance laboratory practicals
by: Jennifer Schneider, et al.
Published: (2014-06-01)

Multi-encoder attention-based architectures for sound recognition with partial visual assistance
by: Wim Boes, et al.
Published: (2022-10-01)

Automatic Spatial Audio Scene Classification in Binaural Recordings of Music
by: Sławomir K. Zieliński, et al.
Published: (2019-04-01)

Using Audio Visual Media to Improve English Learning Outcomes
by: Sarah Maulida, et al.
Published: (2022-04-01)

Survey in Image and Audio Steganography by using the Deep Learning Methods
by: Zeina Al Hadad, et al.
Published: (2023-08-01)

Dataset of audio signals from brushless DC motors for predictive maintenance
by: Rommel Stiward Prieto Estacio, et al.
Published: (2023-10-01)

Ar-DAD: Arabic diversified audio dataset
by: Mohammed Lataifeh, et al.
Published: (2020-12-01)

Analisis Kemampuan Mpeg Spatial Audio Object Coding untuk Reproduksi Audio Multikanal
by: Amirul Luthfi, et al.
Published: (2017-07-01)

Blended Learning in Audio Description Training
by: Anna Jankowska
Published: (2017-12-01)

Penggunaan Media Pembelajaran Audio Visual dalam Mata Pelajaran PKn
by: Faizah Faizah
Published: (2017-09-01)

Spectrogram Dataset of Korean Smartphone Audio Files Forged Using the “Mix Paste” Command
by: Yeongmin Son, et al.
Published: (2023-12-01)

Audio-Based Aircraft Detection System for Safe RPAS BVLOS Operations
by: Jorge Mariscal-Harana, et al.
Published: (2020-12-01)

Design Dimensions of Co-Located Multi-Device Audio Experiences
by: David Geary, et al.
Published: (2022-07-01)

Audio resource in libraries of Ukraine: state and perspectives
by: Humenchuk Vasyl
Published: (2021-01-01)

Audio deepfakes: A survey
by: Zahra Khanjani, et al.
Published: (2023-01-01)

PENGARUH PENGGUNAAN MEDIA AUDIO DAN MOTIVASI BELAJAR SISWA TERHADAP HASIL BELAJAR BAHASA INDONESIA SISWA KELAS VSDN 001 RUMBAI KOTA PEKANBARU
by: Asmardi '
Published: (2016-12-01)

An Automatic Digital Audio Authentication/Forensics System
by: Zulfiqar Ali, et al.
Published: (2017-01-01)

Capcut-assisted Audio Visual Media to Improve Learning Outcomes of IPAS Students in Grade IV Elementary School
by: Nadirah Nadirah, et al.
Published: (2022-06-01)

Audio/ Videoconferencing Packages: High cost
by: Sonia Murillo, et al.
Published: (2006-02-01)