Grounding referring expression in computer vision
This project studies the integration of language and vision in computer vision, focusing on Grounding Referring Expressions utilising the state-of-the-art GroundingDINO model. We address the topic of object identification and segmentation, emphasising zero-shot models’ ability to recognise items...
Main Author: | Yuen, Shaun Chien Wee |
---|---|
Other Authors: | Hanwang Zhang |
Format: | Final Year Project (FYP) |
Language: | English |
Published: |
Nanyang Technological University
2024
|
Subjects: | |
Online Access: | https://hdl.handle.net/10356/174979 |
Similar Items
-
Vision, brain and cooperative computation/
by: Arbib, Michael A., et al.
Published: (1987) -
Human and Machine Vision /
by: Seeland, Anett, contributor, et al.
Published: (2015) -
PrefAce: face-centric pretraining with self-structure aware distillation
by: Hu, Siyuan
Published: (2024) -
Feature binding and robust vision in machines and primates
by: Leadholm, N
Published: (2022) -
When Computer Vision Gazes at Cognition
by: Gao, Tao, et al.
Published: (2015)