Towards unbiased visual language reasoning and consistent segmentation
In recent years, we have made significant advances in standard recognition tasks such as classification, detection or segmentation. To further understand from vi- sion, more and more researchers pay attention to introduce text information for reasoning. Such as image caption, visual question answeri...
Main Author: | Huang, Jianqiang |
---|---|
Other Authors: | Hanwang Zhang |
Format: | Thesis-Doctor of Philosophy |
Language: | English |
Published: |
Nanyang Technological University
2023
|
Subjects: | |
Online Access: | https://hdl.handle.net/10356/169540 |
Similar Items
Similar Items
-
Exploiting visual context and consistency for semantic segmentation
by: Kang, Dang
Published: (2018) -
Towards unbiased visual emotion recognition via causal intervention
by: Chen, Yuedong, et al.
Published: (2023) -
Instance LSeg - exploring instance level information from visual language model
by: Lin, Zixing
Published: (2023) -
Reasoning over multiple human-human interaction activities
by: Perez, Mauricio Lisboa
Published: (2021) -
Language-guided visual retrieval
by: He, Su
Published: (2021)