Region Collaborative Network for Detection-Based Vision-Language Understanding

Given a query language, a Detection-based Vision-Language Understanding (DVLU) system needs to respond based on the detected regions (i.e.,bounding boxes). With the significant advancement in object detection, DVLU has witnessed great improvements in recent years, such as Visual Question Answering (...

Full description

Bibliographic Details
Main Authors: Linyan Li, Kaile Du, Minming Gu, Fuyuan Hu, Fan Lyu
Format: Article
Language:English
Published: MDPI AG 2022-08-01
Series:Mathematics
Subjects:
Online Access:https://www.mdpi.com/2227-7390/10/17/3110