Rethinking visual prompting for multimodal large language models with external knowledge

Rethinking visual prompting for multimodal large language models with external knowledge

In recent years, multimodal large language models (MLLMs) have made significant strides by training on vast high-quality image-text datasets, enabling them to generally understand images well. However, the inherent difficulty in explicitly conveying fine-grained or spatially dense information in tex...

Podrobná bibliografie
Hlavní autoři:	Lin, Y, Li, Y, Chen, D, Xu, W, Clark, R, Torr, P, Yuan, L
Médium:	Internet publication
Jazyk:	English
Vydáno:	2024

Podobné jednotky

Prompting Large Language Models with Knowledge-Injection for Knowledge-Based Visual Question Answering
Autor: Zhongjian Hu, a další
Vydáno: (2024-09-01)

Knowledge graph construction for heart failure using large language models with prompt engineering
Autor: Tianhan Xu, a další
Vydáno: (2024-07-01)

Prompt Optimization in Large Language Models
Autor: Antonio Sabbatella, a další
Vydáno: (2024-03-01)

CAT: enhancing multimodal large language model to answer questions in dynamic audio-visual scenarios
Autor: Ye, Q, a další
Vydáno: (2024)

Review of large vision models and visual prompt engineering
Autor: Jiaqi Wang, a další
Vydáno: (2023-11-01)

A unified prompt-based framework for few-shot multimodal language analysis
Autor: Xiaohan Zhang, a další
Vydáno: (2025-06-01)

Learning visual prompts for guiding the attention of vision transformers
Autor: Rezaei, R, a další
Vydáno: (2024)

REKP: Refined External Knowledge into Prompt-Tuning for Few-Shot Text Classification
Autor: Yuzhuo Dang, a další
Vydáno: (2023-11-01)

Improving language model predictions via prompts enriched with knowledge graphs
Autor: Brate, R, a další
Vydáno: (2023)

Aligning, autoencoding and prompting large language models for novel disease reporting
Autor: Liu, F, a další
Vydáno: (2025)

uCAP: an unsupervised prompting method for vision-language models
Autor: Nguyen, AT, a další
Vydáno: (2024)

Predictive Prompts with Joint Training of Large Language Models for Explainable Recommendation
Autor: Ching-Sheng Lin, a další
Vydáno: (2023-10-01)

Extracting Fruit Disease Knowledge from Research Papers Based on Large Language Models and Prompt Engineering
Autor: Yunqiao Fei, a další
Vydáno: (2025-01-01)

Balancing Privacy and Robustness in Prompt Learning for Large Language Models
Autor: Chiyu Shi, a další
Vydáno: (2024-10-01)

Response Generated by Large Language Models Depends on the Structure of the Prompt
Autor: Pradosh Kumar Sarangi, a další
Vydáno: (2024-07-01)

Prompt Engineering: Guiding the Way to Effective Large Language Models
Autor: Mohammad Aljanabi, a další
Vydáno: (2023-11-01)

An image is worth 1000 lies: adversarial transferability across prompts on vision-language models
Autor: Luo, H, a další
Vydáno: (2024)

A Brief Overview of Few-Shot Prompting in the Large Language Models
Autor: Vladlen Kulikov, a další
Vydáno: (2023-05-01)

Diagnostic reasoning prompts reveal the potential for large language model interpretability in medicine
Autor: Thomas Savage, a další
Vydáno: (2024-01-01)

The application of multimodal large language models in medicine
Autor: Jianing Qiu, a další
Vydáno: (2024-04-01)

Clinical prompt learning with frozen language models
Autor: Taylor, N, a další
Vydáno: (2023)

LLMR: Real-time Prompting of Interactive Worlds using Large Language Models
Autor: De La Torre, Fernanda, a další
Vydáno: (2024)

Large language model enhanced with prompt-based vanilla distillation for sentence embeddings
Autor: Wang, Minghao
Vydáno: (2024)

Large multimodal models for visual reasoning
Autor: Duong, Ngoc Yen
Vydáno: (2024)

Intelligent extraction of reservoir dispatching information integrating large language model and structured prompts
Autor: Yangrui Yang, a další
Vydáno: (2024-06-01)

A Security Risk Taxonomy for Prompt-Based Interaction With Large Language Models
Autor: Erik Derner, a další
Vydáno: (2024-01-01)

DetToolChain: a new prompting paradigm to unleash detection ability of MLLM
Autor: Wu, Y, a další
Vydáno: (2024)

Research and application of defense mechanism for prompt injection attack of large language model in financial industry
Autor: MOU Daen, a další
Vydáno: (2024-10-01)

A medical multimodal large language model for future pandemics
Autor: Liu, F, a další
Vydáno: (2023)

On the legal implications of Large Language Model answers: A prompt engineering approach and a view beyond by exploiting Knowledge Graphs
Autor: George Hannah, a další
Vydáno: (2025-01-01)

Rethinking Language
Autor: Gastor Mapunda, a další
Vydáno: (2024-09-01)

Do Language Models Enjoy Their Own Stories? Prompting Large Language Models for Automatic Story Evaluation
Autor: Cyril Chhun, a další
Vydáno: (2024-09-01)

Harnessing multimodal large language models for traffic knowledge graph generation and decision-making
Autor: Senyun Kuang, a další
Vydáno: (2024-12-01)

PromptSMILES: prompting for scaffold decoration and fragment linking in chemical language models
Autor: Morgan Thomas, a další
Vydáno: (2024-07-01)

The influence of knowledge visualization on externalizing tacit knowledge
Autor: Ahmad, Khairul Bariah, a další
Vydáno: (2011)

Rethinking of Coase Theorem: Externalities and Uncertainty
Autor: Evgeny A. Kuzmin, a další
Vydáno: (2015-10-01)

Rethinking of Coase Theorem: Externalities and Uncertainty
Autor: Evgeny A. Kuzmin, a další
Vydáno: (2015-10-01)

Rethinking of Coase Theorem: Externalities and Uncertainty
Autor: Evgeny A. Kuzmin, a další
Vydáno: (2015-10-01)

Rethinking of Coase Theorem: Externalities and Uncertainty
Autor: Evgeny A. Kuzmin, a další
Vydáno: (2015-12-01)

TEACHING ENGLISH AS A FOREIGN LANGUAGE: RETHINKING THE MULTIMODALITY AND COMMUNICATION SKILLS IN THE 21st CENTURY
Autor: Liudmyla Byrkun
Vydáno: (2023-12-01)