Epic-sounds: a large-scale dataset of actions that sound

We introduce EPIC-SOUNDS, a large-scale dataset of audio annotations capturing temporal extents and class labels within the audio stream of the egocentric videos from EPIC-KITCHENS-100. We propose an annotation pipeline where annotators temporally label distinguishable audio segments and describe th...

Πλήρης περιγραφή

Λεπτομέρειες βιβλιογραφικής εγγραφής
Κύριοι συγγραφείς: Huh, J, Chalk, J, Kazakos, E, Damen, D, Zisserman, A
Μορφή: Conference item
Γλώσσα:English
Έκδοση: IEEE 2023