Assessing robustness of text classification through maximal safe radius computation

Neural network NLP models are vulnerable to small modifications of the input that maintain the original meaning but result in a different prediction. In this paper, we focus on robustness of text classification against word substitutions, aiming to provide guarantees that the model prediction does n...

Full description

Bibliographic Details
Main Authors: La Malfa, E, Wu, M, Laurenti, L, Wang, B, Hartshorn, A, Kwiatkowska, M
Format: Conference item
Language:English
Published: Association for Computational Linguistics 2020