Towards robust interpretability with self-explaining neural networks
© 2018 Curran Associates Inc.All rights reserved. Most recent work on interpretability of complex machine learning models has focused on estimating a posteriori explanations for previously trained models around specific predictions. Self-explaining models where interpretability plays a key role alre...
Main Authors: | Jaakkola, Tommi, Alvarez Melis, David |
---|---|
Format: | Article |
Language: | English |
Published: |
2021
|
Online Access: | https://hdl.handle.net/1721.1/137669 |
Similar Items
-
Towards robust interpretability with self-explaining neural networks
by: Jaakkola, Tommi, et al.
Published: (2021) -
Towards optimal transport with global invariances
by: Alvarez Melis, David, et al.
Published: (2021) -
Towards robust explainability of deep neural networks against attribution attacks
by: Wang, Fan
Published: (2024) -
Word Embeddings as Metric Recovery in Semantic Spaces
by: Hashimoto, Tatsunori B, et al.
Published: (2021) -
Word Embeddings as Metric Recovery in Semantic Spaces
by: Tatsunori B. Hashimoto, et al.
Published: (2021-03-01)