Verbs in action: improving verb understanding in video-language models
Understanding verbs is crucial to modelling how people and objects interact with each other and the environment through space and time. Recently, state-of-the-art video-language models based on CLIP have been shown to have limited verb understanding and to rely extensively on nouns, restricting thei...
Auteurs principaux: | Momeni, L, Caron, M, Nagrani, A, Zisserman, A, Schmid, C |
---|---|
Format: | Conference item |
Langue: | English |
Publié: |
IEEE
2024
|
Documents similaires
-
Auxiliary verbs
par: Fernández, LG, et autres
Publié: (2024) -
Verb natures /
par: Ferre, Albert, et autres
Publié: (2006) -
The English verb /
par: 206859 Palmer, F. R.
Publié: (1974) -
The English verb /
par: 206859 Palmer, F. R.
Publié: (1987) -
The structure of the Tamil verb
par: Sathasivam, A
Publié: (1956)