Dataset Interfaces: Diagnosing Model Failures Using Controllable Counterfactual Generation

Distribution shift is a major source of failure for machine learning models. However, evaluating model reliability under distribution shift can be challenging, especially since it may be difficult to acquire counterfactual examples that exhibit a specified shift. In this work, we introduce the notio...

Full description

Bibliographic Details
Main Author: Vendrow, Joshua L.
Other Authors: Mądry, Aleksander
Format: Thesis
Published: Massachusetts Institute of Technology 2024
Online Access:https://hdl.handle.net/1721.1/156303