Towards robust interpretability with self-explaining neural networks

© 2018 Curran Associates Inc.All rights reserved. Most recent work on interpretability of complex machine learning models has focused on estimating a posteriori explanations for previously trained models around specific predictions. Self-explaining models where interpretability plays a key role alre...

Full description

Bibliographic Details
Main Authors: Jaakkola, Tommi, Alvarez Melis, David
Other Authors: Massachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratory
Format: Article
Language:English
Published: 2021
Online Access:https://hdl.handle.net/1721.1/137669.3