Level-wise aligned dual networks for text–video retrieval
Abstract The vast amount of videos on the Internet makes efficient and accurate text–video retrieval tasks increasingly important. The current methods leverage a high-dimensional space to align video and text for these tasks. However, a high-dimensional space cannot fully use different levels of inf...
Main Authors: | , , |
---|---|
Format: | Article |
Language: | English |
Published: |
SpringerOpen
2022-07-01
|
Series: | EURASIP Journal on Advances in Signal Processing |
Subjects: | |
Online Access: | https://doi.org/10.1186/s13634-022-00887-y |