Aligning Faithful Interpretations with their Social Attribution

AbstractWe find that the requirement of model interpretations to be faithful is vague and incomplete. With interpretation by textual highlights as a case study, we present several failure cases. Borrowing concepts from social science, we identify that the problem is a misalignment be...

Full description

Bibliographic Details
Main Authors: Alon Jacovi, Yoav Goldberg
Format: Article
Language:English
Published: The MIT Press 2021-01-01
Series:Transactions of the Association for Computational Linguistics
Online Access:https://direct.mit.edu/tacl/article/doi/10.1162/tacl_a_00367/98620/Aligning-Faithful-Interpretations-with-their