Input features for deep learning-based polyphonic sound event localization and detection
Sound event localization and detection (SELD) is an emerging research topic that combines the tasks of sound event detection (SED) and direction-of-arrival estimation (DOAE). The SELD task aims to jointly recognize the sound classes and estimate the directions of arrival (DOAs) and the temporal acti...
Main Author: | Nguyen, Thi Ngoc Tho |
---|---|
Other Authors: | Gan Woon Seng |
Format: | Thesis-Doctor of Philosophy |
Language: | English |
Published: |
Nanyang Technological University
2023
|
Subjects: | |
Online Access: | https://hdl.handle.net/10356/168245 |
Similar Items
-
Recognizing sound-event by machine learning
by: Athaariq Ramadino
Published: (2020) -
SALSA: spatial cue-augmented log-spectrogram features for polyphonic sound event localization and detection
by: Nguyen, Thi Ngoc Tho, et al.
Published: (2022) -
Deep learning for anomaly detection in computational imaging
by: Du, Xinglin
Published: (2022) -
Deep neural networks based visual object detection in videos
by: Jin, Ruibing
Published: (2020) -
Feature extraction and dimensionality reduction in pattern recognition with applications in speech recognition
by: Jiang, Hai
Published: (2008)