Sparse in space and time: audio-visual synchronisation with trainable selectors
<p>The objective of this paper is audio-visual synchronisation of general videos ‘in the wild’. For such videos, the events that may be harnessed for synchronisation cues may be spatially small and may occur only infrequently during a many seconds-long video clip, i.e. the...
Huvudupphovsmän: | , , , |
---|---|
Materialtyp: | Conference item |
Språk: | English |
Publicerad: |
British Machine Vision Association
2022
|