Deep Learning for Audio Event Detection and Tagging on Low-Resource Datasets
In training a deep learning system to perform audio transcription, two practical problems may arise. Firstly, most datasets are weakly labelled, having only a list of events present in each recording without any temporal information for training. Secondly, deep neural networks need a very large amou...
Main Authors: | Veronica Morfi, Dan Stowell |
---|---|
Format: | Article |
Language: | English |
Published: |
MDPI AG
2018-08-01
|
Series: | Applied Sciences |
Subjects: | |
Online Access: | http://www.mdpi.com/2076-3417/8/8/1397 |
Similar Items
-
Deep multiple instance learning for foreground speech localization in ambient audio from wearable devices
by: Rajat Hebbar, et al.
Published: (2021-02-01) -
Deep Convolutional Neural Network with Structured Prediction for Weakly Supervised Audio Event Detection
by: Inkyu Choi, et al.
Published: (2019-06-01) -
Classification of Overlapped Audio Events Based on AT, PLSA, and the Combination of Them
by: Y. Leng, et al.
Published: (2015-06-01) -
A Large-Scale Benchmark Dataset for Anomaly Detection and Rare Event Classification for Audio Forensics
by: Ahmed Abbasi, et al.
Published: (2022-01-01) -
VI-PANN: Harnessing Transfer Learning and Uncertainty-Aware Variational Inference for Improved Generalization in Audio Pattern Recognition
by: John Fischer, et al.
Published: (2024-01-01)