Sparse in space and time: audio-visual synchronisation with trainable selectors
<p>The objective of this paper is audio-visual synchronisation of general videos ‘in the wild’. For such videos, the events that may be harnessed for synchronisation cues may be spatially small and may occur only infrequently during a many seconds-long video clip, i.e. the...
Main Authors: | , , , |
---|---|
Format: | Conference item |
Language: | English |
Published: |
British Machine Vision Association
2022
|