Towards robust interpretability with self-explaining neural networks

© 2018 Curran Associates Inc.All rights reserved. Most recent work on interpretability of complex machine learning models has focused on estimating a posteriori explanations for previously trained models around specific predictions. Self-explaining models where interpretability plays a key role alre...

Full description

Bibliographic Details
Main Authors:	Jaakkola, Tommi, Alvarez Melis, David
Other Authors:	Massachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratory
Format:	Article
Language:	English
Published:	2021
Online Access:	https://hdl.handle.net/1721.1/137669.3

Internet

https://hdl.handle.net/1721.1/137669.3

Towards robust interpretability with self-explaining neural networks

Internet

Similar Items