Interpretability Is in the Mind of the Beholder: A Causal Framework for Human-Interpretable Representation Learning

Research on Explainable Artificial Intelligence has recently started exploring the idea of producing explanations that, rather than being expressed in terms of low-level features, are encoded in terms of <i>interpretable concepts learned from data</i>. How to reliably acquire such concep...

Full description

Bibliographic Details
Main Authors:	Emanuele Marconato, Andrea Passerini, Stefano Teso
Format:	Article
Language:	English
Published:	MDPI AG 2023-11-01
Series:	Entropy
Subjects:	explainable AI causal representation learning alignment disentanglement causal abstractions concept leakage
Online Access:	https://www.mdpi.com/1099-4300/25/12/1574

Internet

https://www.mdpi.com/1099-4300/25/12/1574

Interpretability Is in the Mind of the Beholder: A Causal Framework for Human-Interpretable Representation Learning

Internet

Similar Items