Dynamic Debiasing Network for Visual Commonsense Generation

The task of Visual Commonsense Generation (VCG) delves into the deeper narrative behind a static image, aiming to comprehend not just its immediate content but also the surrounding context. The VCG model generates three types of captions for each image: 1) the events preceding the image, 2) the char...

Full description

Bibliographic Details
Main Authors:	Jungeun Kim, Jinwoo Park, Jaekwang Seok, Junyeong Kim
Format:	Article
Language:	English
Published:	IEEE 2023-01-01
Series:	IEEE Access
Subjects:	Multimodal reasoning visual commonsense generation VisualCOMET dataset bias debiasing causal inference
Online Access:	https://ieeexplore.ieee.org/document/10348563/

Internet

https://ieeexplore.ieee.org/document/10348563/

Dynamic Debiasing Network for Visual Commonsense Generation

Internet

Similar Items