Self-Supervised Sound Promotion Method of Sound Localization from Video

Compared to traditional unimodal methods, multimodal audio-visual correspondence learning has many advantages in the field of video understanding, but it also faces significant challenges. In order to fully utilize the feature information from both modalities, we needs to ensure accurate alignment o...

Full description

Bibliographic Details
Main Authors: Yang Li, Xiaoli Zhao, Zhuoyao Zhang
Format: Article
Language:English
Published: MDPI AG 2023-08-01
Series:Electronics
Subjects:
Online Access:https://www.mdpi.com/2079-9292/12/17/3558