Towards robust interpretability with self-explaining neural networks

Towards robust interpretability with self-explaining neural networks

Show other versions (1)

© 2018 Curran Associates Inc.All rights reserved. Most recent work on interpretability of complex machine learning models has focused on estimating a posteriori explanations for previously trained models around specific predictions. Self-explaining models where interpretability plays a key role alre...

Full description

Bibliographic Details
Main Authors:	Jaakkola, Tommi, Alvarez Melis, David
Format:	Article
Language:	English
Published:	2021
Online Access:	https://hdl.handle.net/1721.1/137669

Similar Items

Towards robust interpretability with self-explaining neural networks
by: Jaakkola, Tommi, et al.
Published: (2021)

Towards optimal transport with global invariances
by: Alvarez Melis, David, et al.
Published: (2021)

Towards robust explainability of deep neural networks against attribution attacks
by: Wang, Fan
Published: (2024)

Word Embeddings as Metric Recovery in Semantic Spaces
by: Hashimoto, Tatsunori B, et al.
Published: (2021)

Word Embeddings as Metric Recovery in Semantic Spaces
by: Tatsunori B. Hashimoto, et al.
Published: (2021-03-01)

Principal differences analysis: Interpretable characterization of differences between distributions
by: Mueller, Jonas Weylin, et al.
Published: (2018)

Tight certificates of adversarial robustness for randomly smoothed classifiers
by: Lee, Guang-He, et al.
Published: (2021)

Rationalizing Neural Predictions
by: Lei, Tao, et al.
Published: (2020)

Towards robust neural networks: evaluation and construction
by: Lu, J
Published: (2021)

Towards interpretable & robust face recognition
by: Pattra, Surya Paryanta
Published: (2022)

Toward conversational interpretations of neural networks: data collection
by: Yeow, Ming Xuan
Published: (2024)

Towards deep neural networks robust to adversarial examples
by: Matyasko, Alexander
Published: (2020)

Towards robust machine learning with graph neural networks
by: Jaeckle, F
Published: (2022)

Explaining deep neural networks
by: Camburu, OM
Published: (2020)

Towards interpretable & robust occluded facial recognition
by: Rachita, Agrawal
Published: (2023)

Explaining meaning: towards a minimalist account of legal interpretation
by: Barradas de Freitas, R
Published: (2014)

XElemNet: towards explainable AI for deep neural networks in materials science
by: Kewei Wang, et al.
Published: (2024-10-01)

Computing Upper and Lower Bounds on Likelihoods in Intractable Networks
by: Jaakkola, Tommi S., et al.
Published: (2004)

Deriving neural architectures from sequence and graph kernels
by: Lei, Tao, et al.
Published: (2021)

6.867 Machine Learning, Fall 2002
by: Jaakkola, Tommi S. (Tommi Sakari)
Published: (2002)

Variational methods for inference and estimation in graphical models
by: Jaakkola, Tommi S. (Tommi Sakari)
Published: (2005)

Explainable Artificial Intelligence for Bayesian Neural Networks: Toward Trustworthy Predictions of Ocean Dynamics
by: Mariana C. A. Clare, et al.
Published: (2022-11-01)

Aspect-augmented Adversarial Networks for Domain Adaptation
by: Zhang, Yuan, et al.
Published: (2021)

Aspect-augmented Adversarial Networks for Domain Adaptation
by: Yuan Zhang, et al.
Published: (2021-03-01)

Explainable equivariant neural networks for particle physics: PELICAN
by: Alexander Bogatskiy, et al.
Published: (2024-03-01)

Explainable neural computation via stack neural module networks
by: Hu, Ronghang, et al.
Published: (2022)

A hybrid transformer and attention based recurrent neural network for robust and interpretable sentiment analysis of tweets
by: Md Abrar Jahin, et al.
Published: (2024-10-01)

Towards verifying robustness of neural networks against a family of semantic perturbations
by: Mohapatra, Jeet, et al.
Published: (2021)

Learning bayesian network structure using lp relaxations
by: Jaakkola, Tommi S., et al.
Published: (2011)

Towards Robust And Practical Neural Video-Conferencing
by: Sivaraman, Vibhaalakshmi
Published: (2024)

Tree block coordinate descent for map in graphical models
by: Sontag, David Alexander, et al.
Published: (2011)

Reliable and Faithful Generative Explainers for Graph Neural Networks
by: Yiqiao Li, et al.
Published: (2024-12-01)

Visual Analytics in Explaining Neural Networks with Neuron Clustering
by: Gulsum Alicioglu, et al.
Published: (2024-04-01)

Robust diagnosis and meta visualizations of plant diseases through deep neural architecture with explainable AI
by: Sasikaladevi Natarajan, et al.
Published: (2024-06-01)

Mean Field Theory for Sigmoid Belief Networks
by: Saul, Lawrence K., et al.
Published: (2004)

Visualizing interpretations of deep neural networks
by: Ta, Quynh Nga
Published: (2023)

Visualizing interpretations of deep neural networks
by: Tan, Ryan Kang Wei
Published: (2022)

Automated Mechanistic Interpretability for Neural Networks
by: Liao, Isaac C.
Published: (2024)

Adversarial robustness of Bayesian neural networks
by: Wicker, M
Published: (2021)

Analysis of robust neural networks for control
by: Newton, M
Published: (2023)