Rethinking visual prompting for multimodal large language models with external knowledge

Rethinking visual prompting for multimodal large language models with external knowledge

In recent years, multimodal large language models (MLLMs) have made significant strides by training on vast high-quality image-text datasets, enabling them to generally understand images well. However, the inherent difficulty in explicitly conveying fine-grained or spatially dense information in tex...

書誌詳細
主要な著者:	Lin, Y, Li, Y, Chen, D, Xu, W, Clark, R, Torr, P, Yuan, L
フォーマット:	Internet publication
言語:	English
出版事項:	2024

類似資料

Prompting Large Language Models with Knowledge-Injection for Knowledge-Based Visual Question Answering
著者:: Zhongjian Hu, 等
出版事項: (2024-09-01)

Knowledge graph construction for heart failure using large language models with prompt engineering
著者:: Tianhan Xu, 等
出版事項: (2024-07-01)

Prompt Optimization in Large Language Models
著者:: Antonio Sabbatella, 等
出版事項: (2024-03-01)

CAT: enhancing multimodal large language model to answer questions in dynamic audio-visual scenarios
著者:: Ye, Q, 等
出版事項: (2024)

Review of large vision models and visual prompt engineering
著者:: Jiaqi Wang, 等
出版事項: (2023-11-01)

A unified prompt-based framework for few-shot multimodal language analysis
著者:: Xiaohan Zhang, 等
出版事項: (2025-06-01)

Learning visual prompts for guiding the attention of vision transformers
著者:: Rezaei, R, 等
出版事項: (2024)

REKP: Refined External Knowledge into Prompt-Tuning for Few-Shot Text Classification
著者:: Yuzhuo Dang, 等
出版事項: (2023-11-01)

Improving language model predictions via prompts enriched with knowledge graphs
著者:: Brate, R, 等
出版事項: (2023)

Aligning, autoencoding and prompting large language models for novel disease reporting
著者:: Liu, F, 等
出版事項: (2025)

uCAP: an unsupervised prompting method for vision-language models
著者:: Nguyen, AT, 等
出版事項: (2024)

Predictive Prompts with Joint Training of Large Language Models for Explainable Recommendation
著者:: Ching-Sheng Lin, 等
出版事項: (2023-10-01)

Extracting Fruit Disease Knowledge from Research Papers Based on Large Language Models and Prompt Engineering
著者:: Yunqiao Fei, 等
出版事項: (2025-01-01)

Balancing Privacy and Robustness in Prompt Learning for Large Language Models
著者:: Chiyu Shi, 等
出版事項: (2024-10-01)

Response Generated by Large Language Models Depends on the Structure of the Prompt
著者:: Pradosh Kumar Sarangi, 等
出版事項: (2024-07-01)

Prompt Engineering: Guiding the Way to Effective Large Language Models
著者:: Mohammad Aljanabi, 等
出版事項: (2023-11-01)

An image is worth 1000 lies: adversarial transferability across prompts on vision-language models
著者:: Luo, H, 等
出版事項: (2024)

A Brief Overview of Few-Shot Prompting in the Large Language Models
著者:: Vladlen Kulikov, 等
出版事項: (2023-05-01)

Diagnostic reasoning prompts reveal the potential for large language model interpretability in medicine
著者:: Thomas Savage, 等
出版事項: (2024-01-01)

The application of multimodal large language models in medicine
著者:: Jianing Qiu, 等
出版事項: (2024-04-01)

Clinical prompt learning with frozen language models
著者:: Taylor, N, 等
出版事項: (2023)

LLMR: Real-time Prompting of Interactive Worlds using Large Language Models
著者:: De La Torre, Fernanda, 等
出版事項: (2024)

Large language model enhanced with prompt-based vanilla distillation for sentence embeddings
著者:: Wang, Minghao
出版事項: (2024)

Large multimodal models for visual reasoning
著者:: Duong, Ngoc Yen
出版事項: (2024)

Intelligent extraction of reservoir dispatching information integrating large language model and structured prompts
著者:: Yangrui Yang, 等
出版事項: (2024-06-01)

A Security Risk Taxonomy for Prompt-Based Interaction With Large Language Models
著者:: Erik Derner, 等
出版事項: (2024-01-01)

DetToolChain: a new prompting paradigm to unleash detection ability of MLLM
著者:: Wu, Y, 等
出版事項: (2024)

Research and application of defense mechanism for prompt injection attack of large language model in financial industry
著者:: MOU Daen, 等
出版事項: (2024-10-01)

A medical multimodal large language model for future pandemics
著者:: Liu, F, 等
出版事項: (2023)

On the legal implications of Large Language Model answers: A prompt engineering approach and a view beyond by exploiting Knowledge Graphs
著者:: George Hannah, 等
出版事項: (2025-01-01)

Rethinking Language
著者:: Gastor Mapunda, 等
出版事項: (2024-09-01)

Do Language Models Enjoy Their Own Stories? Prompting Large Language Models for Automatic Story Evaluation
著者:: Cyril Chhun, 等
出版事項: (2024-09-01)

Harnessing multimodal large language models for traffic knowledge graph generation and decision-making
著者:: Senyun Kuang, 等
出版事項: (2024-12-01)

PromptSMILES: prompting for scaffold decoration and fragment linking in chemical language models
著者:: Morgan Thomas, 等
出版事項: (2024-07-01)

The influence of knowledge visualization on externalizing tacit knowledge
著者:: Ahmad, Khairul Bariah, 等
出版事項: (2011)

Rethinking of Coase Theorem: Externalities and Uncertainty
著者:: Evgeny A. Kuzmin, 等
出版事項: (2015-10-01)

Rethinking of Coase Theorem: Externalities and Uncertainty
著者:: Evgeny A. Kuzmin, 等
出版事項: (2015-10-01)

Rethinking of Coase Theorem: Externalities and Uncertainty
著者:: Evgeny A. Kuzmin, 等
出版事項: (2015-10-01)

Rethinking of Coase Theorem: Externalities and Uncertainty
著者:: Evgeny A. Kuzmin, 等
出版事項: (2015-12-01)

TEACHING ENGLISH AS A FOREIGN LANGUAGE: RETHINKING THE MULTIMODALITY AND COMMUNICATION SKILLS IN THE 21st CENTURY
著者:: Liudmyla Byrkun
出版事項: (2023-12-01)