Embodied object hunt

This study investigates the use of multimodal encoders in the Embodied Object Hunt task. The motivation behind this approach is recent developments in joint multimodal encoders such as CLIP that are able to extract common features between images and text. This ability is ideal for tasks combining...

Full description

Bibliographic Details
Main Author:	Kam, Rainer I-Wen
Other Authors:	Cham Tat Jen
Format:	Final Year Project (FYP)
Language:	English
Published:	Nanyang Technological University 2024
Subjects:	Computer and Information Science
Online Access:	https://hdl.handle.net/10356/175084

Internet

https://hdl.handle.net/10356/175084

Embodied object hunt

Internet

Similar Items