Energy-latency manipulation of multi-modal large language models via verbose samples
Despite the exceptional performance of multi-modal large language models (MLLMs), their deployment requires substantial computational resources. Once malicious users induce high energy consumption and latency time (energy-latency cost), it will exhaust computational resources and harm availability o...
Main Authors: | , , , , , , |
---|---|
Format: | Conference item |
Language: | English |
Published: |
2024
|