Stop reasoning! When multimodal LLMs with chain-of-thought reasoning meets adversarial images

Recently, Multimodal LLMs (MLLMs) have shown a great ability to understand images. However, like traditional vision models, they are still vulnerable to adversarial images. Meanwhile, Chain-of-Thought (CoT) reasoning has been widely explored on MLLMs, which not only improves model’s performance, but...

पूर्ण विवरण

ग्रंथसूची विवरण
मुख्य लेखकों:	Wang, Z, Han, Z, Chen, S, Xue, F, Ding, Z, Xiao, X, Tresp, V, Torr, P, Gu, J
स्वरूप:	Conference item
भाषा:	English
प्रकाशित:	IEEE 2024

Stop reasoning! When multimodal LLMs with chain-of-thought reasoning meets adversarial images

समान संसाधन