Energy-latency manipulation of multi-modal large language models via verbose samples

Despite the exceptional performance of multi-modal large language models (MLLMs), their deployment requires substantial computational resources. Once malicious users induce high energy consumption and latency time (energy-latency cost), it will exhaust computational resources and harm availability o...

Full description

Bibliographic Details
Main Authors: Gao, K, Gu, J, Bai, Y, Xia, S-T, Torr, P, Liu, W, Li, Z
Format: Conference item
Language:English
Published: 2024