Rethinking visual prompting for multimodal large language models with external knowledge

Rethinking visual prompting for multimodal large language models with external knowledge

In recent years, multimodal large language models (MLLMs) have made significant strides by training on vast high-quality image-text datasets, enabling them to generally understand images well. However, the inherent difficulty in explicitly conveying fine-grained or spatially dense information in tex...

Volledige beschrijving

Bibliografische gegevens
Hoofdauteurs:	Lin, Y, Li, Y, Chen, D, Xu, W, Clark, R, Torr, P, Yuan, L
Formaat:	Internet publication
Taal:	English
Gepubliceerd in:	2024

Gelijkaardige items

Prompting Large Language Models with Knowledge-Injection for Knowledge-Based Visual Question Answering
door: Zhongjian Hu, et al.
Gepubliceerd in: (2024-09-01)

Knowledge graph construction for heart failure using large language models with prompt engineering
door: Tianhan Xu, et al.
Gepubliceerd in: (2024-07-01)

Prompt Optimization in Large Language Models
door: Antonio Sabbatella, et al.
Gepubliceerd in: (2024-03-01)

CAT: enhancing multimodal large language model to answer questions in dynamic audio-visual scenarios
door: Ye, Q, et al.
Gepubliceerd in: (2024)

Review of large vision models and visual prompt engineering
door: Jiaqi Wang, et al.
Gepubliceerd in: (2023-11-01)

A unified prompt-based framework for few-shot multimodal language analysis
door: Xiaohan Zhang, et al.
Gepubliceerd in: (2025-06-01)

Learning visual prompts for guiding the attention of vision transformers
door: Rezaei, R, et al.
Gepubliceerd in: (2024)

REKP: Refined External Knowledge into Prompt-Tuning for Few-Shot Text Classification
door: Yuzhuo Dang, et al.
Gepubliceerd in: (2023-11-01)

Improving language model predictions via prompts enriched with knowledge graphs
door: Brate, R, et al.
Gepubliceerd in: (2023)

Aligning, autoencoding and prompting large language models for novel disease reporting
door: Liu, F, et al.
Gepubliceerd in: (2025)

uCAP: an unsupervised prompting method for vision-language models
door: Nguyen, AT, et al.
Gepubliceerd in: (2024)

Predictive Prompts with Joint Training of Large Language Models for Explainable Recommendation
door: Ching-Sheng Lin, et al.
Gepubliceerd in: (2023-10-01)

Extracting Fruit Disease Knowledge from Research Papers Based on Large Language Models and Prompt Engineering
door: Yunqiao Fei, et al.
Gepubliceerd in: (2025-01-01)

Balancing Privacy and Robustness in Prompt Learning for Large Language Models
door: Chiyu Shi, et al.
Gepubliceerd in: (2024-10-01)

Response Generated by Large Language Models Depends on the Structure of the Prompt
door: Pradosh Kumar Sarangi, et al.
Gepubliceerd in: (2024-07-01)

Prompt Engineering: Guiding the Way to Effective Large Language Models
door: Mohammad Aljanabi, et al.
Gepubliceerd in: (2023-11-01)

An image is worth 1000 lies: adversarial transferability across prompts on vision-language models
door: Luo, H, et al.
Gepubliceerd in: (2024)

A Brief Overview of Few-Shot Prompting in the Large Language Models
door: Vladlen Kulikov, et al.
Gepubliceerd in: (2023-05-01)

Diagnostic reasoning prompts reveal the potential for large language model interpretability in medicine
door: Thomas Savage, et al.
Gepubliceerd in: (2024-01-01)

The application of multimodal large language models in medicine
door: Jianing Qiu, et al.
Gepubliceerd in: (2024-04-01)

Clinical prompt learning with frozen language models
door: Taylor, N, et al.
Gepubliceerd in: (2023)

LLMR: Real-time Prompting of Interactive Worlds using Large Language Models
door: De La Torre, Fernanda, et al.
Gepubliceerd in: (2024)

Large language model enhanced with prompt-based vanilla distillation for sentence embeddings
door: Wang, Minghao
Gepubliceerd in: (2024)

Large multimodal models for visual reasoning
door: Duong, Ngoc Yen
Gepubliceerd in: (2024)

Intelligent extraction of reservoir dispatching information integrating large language model and structured prompts
door: Yangrui Yang, et al.
Gepubliceerd in: (2024-06-01)

A Security Risk Taxonomy for Prompt-Based Interaction With Large Language Models
door: Erik Derner, et al.
Gepubliceerd in: (2024-01-01)

DetToolChain: a new prompting paradigm to unleash detection ability of MLLM
door: Wu, Y, et al.
Gepubliceerd in: (2024)

Research and application of defense mechanism for prompt injection attack of large language model in financial industry
door: MOU Daen, et al.
Gepubliceerd in: (2024-10-01)

A medical multimodal large language model for future pandemics
door: Liu, F, et al.
Gepubliceerd in: (2023)

On the legal implications of Large Language Model answers: A prompt engineering approach and a view beyond by exploiting Knowledge Graphs
door: George Hannah, et al.
Gepubliceerd in: (2025-01-01)

Rethinking Language
door: Gastor Mapunda, et al.
Gepubliceerd in: (2024-09-01)

Do Language Models Enjoy Their Own Stories? Prompting Large Language Models for Automatic Story Evaluation
door: Cyril Chhun, et al.
Gepubliceerd in: (2024-09-01)

Harnessing multimodal large language models for traffic knowledge graph generation and decision-making
door: Senyun Kuang, et al.
Gepubliceerd in: (2024-12-01)

PromptSMILES: prompting for scaffold decoration and fragment linking in chemical language models
door: Morgan Thomas, et al.
Gepubliceerd in: (2024-07-01)

The influence of knowledge visualization on externalizing tacit knowledge
door: Ahmad, Khairul Bariah, et al.
Gepubliceerd in: (2011)

Rethinking of Coase Theorem: Externalities and Uncertainty
door: Evgeny A. Kuzmin, et al.
Gepubliceerd in: (2015-10-01)

Rethinking of Coase Theorem: Externalities and Uncertainty
door: Evgeny A. Kuzmin, et al.
Gepubliceerd in: (2015-10-01)

Rethinking of Coase Theorem: Externalities and Uncertainty
door: Evgeny A. Kuzmin, et al.
Gepubliceerd in: (2015-10-01)

Rethinking of Coase Theorem: Externalities and Uncertainty
door: Evgeny A. Kuzmin, et al.
Gepubliceerd in: (2015-12-01)

TEACHING ENGLISH AS A FOREIGN LANGUAGE: RETHINKING THE MULTIMODALITY AND COMMUNICATION SKILLS IN THE 21st CENTURY
door: Liudmyla Byrkun
Gepubliceerd in: (2023-12-01)