MSGeN: Multimodal Selective Generation Network for Grounded Explanations

MSGeN: Multimodal Selective Generation Network for Grounded Explanations

Modern models have shown impressive capabilities in visual reasoning tasks. However, the interpretability of their decision-making processes remains a challenge, causing uncertainty in their reliability. In response, we present the Multimodal Selective Generation Network (MSGeN), a novel approach to...

Full description

Bibliographic Details
Main Authors:	Dingbang Li, Wenzhou Chen, Xin Lin
Format:	Article
Language:	English
Published:	MDPI AG 2023-12-01
Series:	Electronics
Subjects:	visual question answering explanation generation multimodal vision and language
Online Access:	https://www.mdpi.com/2079-9292/13/1/152

Similar Items

Multimodal Natural Language Explanation Generation for Visual Question Answering Based on Multiple Reference Data
by: He Zhu, et al.
Published: (2023-05-01)

Survey of Multimodal Medical Question Answering
by: Hilmi Demirhan, et al.
Published: (2023-12-01)

Interpreting vision and language generative models with semantic visual priors
by: Michele Cafagna, et al.
Published: (2023-09-01)

SBVQA 2.0: Robust End-to-End Speech-Based Visual Question Answering for Open-Ended Questions
by: Faris Alasmary, et al.
Published: (2023-01-01)

VL-Meta: Vision-Language Models for Multimodal Meta-Learning
by: Han Ma, et al.
Published: (2024-01-01)

VL-Few: Vision Language Alignment for Multimodal Few-Shot Meta Learning
by: Han Ma, et al.
Published: (2024-01-01)

TASTA: Text‐Assisted Spatial and Temporal Attention Network for Video Question Answering
by: Tian Wang, et al.
Published: (2023-04-01)

Standard refrigeration and air conditioning : questions and answers/
by: 247465 Elonka, Stephen Michael, et al.
Published: (1973)

Contrastive training of a multimodal encoder for medical visual question answering
by: João Daniel Silva, et al.
Published: (2023-05-01)

Attention-Based Multimodal Deep Learning on Vision-Language Data: Models, Datasets, Tasks, Evaluation Metrics and Applications
by: Priyankar Bose, et al.
Published: (2023-01-01)

Vision–Language Model for Visual Question Answering in Medical Imagery
by: Yakoub Bazi, et al.
Published: (2023-03-01)

Counterfactual Mix-Up for Visual Question Answering
by: Jae Won Cho, et al.
Published: (2023-01-01)

Multimodal representative answer extraction in community question answering
by: Ming Li, et al.
Published: (2023-10-01)

Pembangunan pangkalan data soal selidik berasaskan web /
by: 177958 Mohd. Najib Husein
Published: (2002)

Arabic Question Answering Systems: Gap Analysis
by: Mariam M. Biltawi, et al.
Published: (2021-01-01)

A Video Question Answering Model Based on Knowledge Distillation
by: Zhuang Shao, et al.
Published: (2023-06-01)

Semi-Supervised Implicit Augmentation for Data-Scarce VQA
by: Bhargav Dodla, et al.
Published: (2024-02-01)

Stumpers!: answers to hundreds of questions that stumped the experts /
by: Shapiro, Fred R.
Published: (1998)

Inilah soalan anda! : ganjil tapi benar. Jawapan kepada 99 soalan gila-gila /
by: 368756 Meikle, Marg, et al.
Published: (2008)

Kumpulan Soal-Jawab Dalam Post Graduate Course : Jurusan Ilmu Fiqh Dosen-Dosen I.A.I.N. /
by: Muhammad Hasbi Ash-Shiddieqy, author 184015
Published: (1973)

A survey on complex factual question answering
by: Lingxi Zhang, et al.
Published: (2023-01-01)

Advancements in Complex Knowledge Graph Question Answering: A Survey
by: Yiqing Song, et al.
Published: (2023-10-01)

A Metamorphic Testing Approach for Assessing Question Answering Systems
by: Kaiyi Tu, et al.
Published: (2021-03-01)

Machine-to-Machine Visual Dialoguing with ChatGPT for Enriched Textual Image Description
by: Riccardo Ricci, et al.
Published: (2024-01-01)

Review of Visual Question Answering Technology
by: WANG Yu, SUN Haichun
Published: (2023-07-01)

Explanation of the concept of generation disjunction in studying generation z
by: Kamdin Parsakia, et al.
Published: (2023-04-01)

Goal-Driven Visual Question Generation from Radiology Images
by: Mourad Sarrouti, et al.
Published: (2021-08-01)

Answer Category-Aware Answer Selection for Question Answering
by: Weijing Wu, et al.
Published: (2021-01-01)

From heatmaps to structured explanations of image classifiers
by: Li Fuxin, et al.
Published: (2021-12-01)

Is an Explanation a Reason?
by: Paul Rastall
Published: (2022-12-01)

Visual question answering model for fruit tree disease decision-making based on multimodal deep learning
by: Yubin Lan, et al.
Published: (2023-01-01)

DOMAS: DATA ORIENTED MEDICAL VISUAL QUESTION ANSWERING USING SWIN TRANSFORMER
by: Teodora-Alexandra TOADER
Published: (2023-07-01)

Knowledge-Based Visual Question Answering Using Multi-Modal Semantic Graph
by: Lei Jiang, et al.
Published: (2023-03-01)

What Disease Does This Patient Have? A Large-Scale Open Domain Question Answering Dataset from Medical Exams
by: Di Jin, et al.
Published: (2021-07-01)

Explanation and interaction : the computer generation of explanatory dialogues /
by: 280549 Cawsey, Alison
Published: (1992)

1000 questions and answers /
by: 589653 Farndon, John, et al.
Published: (2014)

Essential facts : pockets /
by: Hetherington, Tim, editor, et al.
Published: (1996)

Anda bertanya Rasulullah menjawab /
by: Al Jauziyah, Ibnu Qayyim, author, et al.
Published: (2012)

Rekabentuk template penjanaan soal selidik berkomputer /
by: 175208 Murni Mohd. Nor
Published: (1998)

1001 questions and answers about your car /
by: 286235 Schultz, Morton J.
Published: (1973)