Rethinking visual prompting for multimodal large language models with external knowledge

Rethinking visual prompting for multimodal large language models with external knowledge

In recent years, multimodal large language models (MLLMs) have made significant strides by training on vast high-quality image-text datasets, enabling them to generally understand images well. However, the inherent difficulty in explicitly conveying fine-grained or spatially dense information in tex...

Бүрэн тодорхойлолт

Номзүйн дэлгэрэнгүй
Үндсэн зохиолчид:	Lin, Y, Li, Y, Chen, D, Xu, W, Clark, R, Torr, P, Yuan, L
Формат:	Internet publication
Хэл сонгох:	English
Хэвлэсэн:	2024

Ижил төстэй зүйлс

Prompting Large Language Models with Knowledge-Injection for Knowledge-Based Visual Question Answering
-н: Zhongjian Hu, зэрэг
Хэвлэсэн: (2024-09-01)

Knowledge graph construction for heart failure using large language models with prompt engineering
-н: Tianhan Xu, зэрэг
Хэвлэсэн: (2024-07-01)

Prompt Optimization in Large Language Models
-н: Antonio Sabbatella, зэрэг
Хэвлэсэн: (2024-03-01)

CAT: enhancing multimodal large language model to answer questions in dynamic audio-visual scenarios
-н: Ye, Q, зэрэг
Хэвлэсэн: (2024)

Review of large vision models and visual prompt engineering
-н: Jiaqi Wang, зэрэг
Хэвлэсэн: (2023-11-01)

A unified prompt-based framework for few-shot multimodal language analysis
-н: Xiaohan Zhang, зэрэг
Хэвлэсэн: (2025-06-01)

Learning visual prompts for guiding the attention of vision transformers
-н: Rezaei, R, зэрэг
Хэвлэсэн: (2024)

REKP: Refined External Knowledge into Prompt-Tuning for Few-Shot Text Classification
-н: Yuzhuo Dang, зэрэг
Хэвлэсэн: (2023-11-01)

Improving language model predictions via prompts enriched with knowledge graphs
-н: Brate, R, зэрэг
Хэвлэсэн: (2023)

Aligning, autoencoding and prompting large language models for novel disease reporting
-н: Liu, F, зэрэг
Хэвлэсэн: (2025)

uCAP: an unsupervised prompting method for vision-language models
-н: Nguyen, AT, зэрэг
Хэвлэсэн: (2024)

Predictive Prompts with Joint Training of Large Language Models for Explainable Recommendation
-н: Ching-Sheng Lin, зэрэг
Хэвлэсэн: (2023-10-01)

Extracting Fruit Disease Knowledge from Research Papers Based on Large Language Models and Prompt Engineering
-н: Yunqiao Fei, зэрэг
Хэвлэсэн: (2025-01-01)

Balancing Privacy and Robustness in Prompt Learning for Large Language Models
-н: Chiyu Shi, зэрэг
Хэвлэсэн: (2024-10-01)

Response Generated by Large Language Models Depends on the Structure of the Prompt
-н: Pradosh Kumar Sarangi, зэрэг
Хэвлэсэн: (2024-07-01)

Prompt Engineering: Guiding the Way to Effective Large Language Models
-н: Mohammad Aljanabi, зэрэг
Хэвлэсэн: (2023-11-01)

An image is worth 1000 lies: adversarial transferability across prompts on vision-language models
-н: Luo, H, зэрэг
Хэвлэсэн: (2024)

A Brief Overview of Few-Shot Prompting in the Large Language Models
-н: Vladlen Kulikov, зэрэг
Хэвлэсэн: (2023-05-01)

Diagnostic reasoning prompts reveal the potential for large language model interpretability in medicine
-н: Thomas Savage, зэрэг
Хэвлэсэн: (2024-01-01)

The application of multimodal large language models in medicine
-н: Jianing Qiu, зэрэг
Хэвлэсэн: (2024-04-01)

Clinical prompt learning with frozen language models
-н: Taylor, N, зэрэг
Хэвлэсэн: (2023)

LLMR: Real-time Prompting of Interactive Worlds using Large Language Models
-н: De La Torre, Fernanda, зэрэг
Хэвлэсэн: (2024)

Large language model enhanced with prompt-based vanilla distillation for sentence embeddings
-н: Wang, Minghao
Хэвлэсэн: (2024)

Large multimodal models for visual reasoning
-н: Duong, Ngoc Yen
Хэвлэсэн: (2024)

Intelligent extraction of reservoir dispatching information integrating large language model and structured prompts
-н: Yangrui Yang, зэрэг
Хэвлэсэн: (2024-06-01)

A Security Risk Taxonomy for Prompt-Based Interaction With Large Language Models
-н: Erik Derner, зэрэг
Хэвлэсэн: (2024-01-01)

DetToolChain: a new prompting paradigm to unleash detection ability of MLLM
-н: Wu, Y, зэрэг
Хэвлэсэн: (2024)

Research and application of defense mechanism for prompt injection attack of large language model in financial industry
-н: MOU Daen, зэрэг
Хэвлэсэн: (2024-10-01)

A medical multimodal large language model for future pandemics
-н: Liu, F, зэрэг
Хэвлэсэн: (2023)

On the legal implications of Large Language Model answers: A prompt engineering approach and a view beyond by exploiting Knowledge Graphs
-н: George Hannah, зэрэг
Хэвлэсэн: (2025-01-01)

Rethinking Language
-н: Gastor Mapunda, зэрэг
Хэвлэсэн: (2024-09-01)

Do Language Models Enjoy Their Own Stories? Prompting Large Language Models for Automatic Story Evaluation
-н: Cyril Chhun, зэрэг
Хэвлэсэн: (2024-09-01)

Harnessing multimodal large language models for traffic knowledge graph generation and decision-making
-н: Senyun Kuang, зэрэг
Хэвлэсэн: (2024-12-01)

PromptSMILES: prompting for scaffold decoration and fragment linking in chemical language models
-н: Morgan Thomas, зэрэг
Хэвлэсэн: (2024-07-01)

The influence of knowledge visualization on externalizing tacit knowledge
-н: Ahmad, Khairul Bariah, зэрэг
Хэвлэсэн: (2011)

Rethinking of Coase Theorem: Externalities and Uncertainty
-н: Evgeny A. Kuzmin, зэрэг
Хэвлэсэн: (2015-10-01)

Rethinking of Coase Theorem: Externalities and Uncertainty
-н: Evgeny A. Kuzmin, зэрэг
Хэвлэсэн: (2015-10-01)

Rethinking of Coase Theorem: Externalities and Uncertainty
-н: Evgeny A. Kuzmin, зэрэг
Хэвлэсэн: (2015-10-01)

Rethinking of Coase Theorem: Externalities and Uncertainty
-н: Evgeny A. Kuzmin, зэрэг
Хэвлэсэн: (2015-12-01)

TEACHING ENGLISH AS A FOREIGN LANGUAGE: RETHINKING THE MULTIMODALITY AND COMMUNICATION SKILLS IN THE 21st CENTURY
-н: Liudmyla Byrkun
Хэвлэсэн: (2023-12-01)