Відправити по sms: Inducing high energy-latency of large vision-language models with verbose images