Self-Supervised Moving Vehicle Tracking With Stereo Sound

© 2019 IEEE. Humans are able to localize objects in the environment using both visual and auditory cues, integrating information from multiple modalities into a common reference frame. We introduce a system that can leverage unlabeled audiovisual data to learn to localize objects (moving vehicles) i...

Full description

Bibliographic Details
Main Authors: Gan, Chuang, Zhao, Hang, Chen, Peihao, Cox, David, Torralba, Antonio
Other Authors: Massachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratory
Format: Article
Language:English
Published: IEEE 2021
Online Access:https://hdl.handle.net/1721.1/137172