A Data-Based Perspective on Model Reliability

Neural networks can fail to generalize to real world data — particularly on subpopulations that might have been mislabelled, corrupted, or underrepresented during training. In such settings, the set of features that a model relies on, or its feature prior, often determines the model’s ultimate relia...

Full description

Bibliographic Details
Main Author:	Jain, Saachi
Other Authors:	Mądry, Aleksander
Format:	Thesis
Published:	Massachusetts Institute of Technology 2024
Online Access:	https://hdl.handle.net/1721.1/153886

_version_	1811078101049802752
author	Jain, Saachi
author2	Mądry, Aleksander
author_facet	Mądry, Aleksander Jain, Saachi
author_sort	Jain, Saachi
collection	MIT
description	Neural networks can fail to generalize to real world data — particularly on subpopulations that might have been mislabelled, corrupted, or underrepresented during training. In such settings, the set of features that a model relies on, or its feature prior, often determines the model’s ultimate reliability. While many factors contribute to a model’s feature prior, recent evidence indicates that the training dataset often plays a pivotal role. This thesis therefore aims to build the foundation for a data-centric perspective on model reliability, by uncovering how the training dataset’s composition affects the model’s feature prior, and thus the mistakes the model tends to make. It advances this objective through two main thrusts: developing scalable tools for identifying model failure modes in large datasets in large datasets and investigating the impact of pre-training data on the reliability of transfer learning models. In the first thrust, we develop techniques for uncovering meaningful patterns of model errors, especially in settings where manual exploration is prohibitively expensive. This includes building a framework for generating counterfactual images to debug model behavior as well as introducing a technique for automatically identifying failure modes by distilling them as directions in a latent space. We also propose a data-based approach to mitigate such failures at their source, by isolating training examples that drive a targeted bias. to mitigate such failures at their source, by isolating training examples that drive a targeted bias. In the second thrust, we investigate the role of the pre-training data in the transfer learning setting, where a pre-trained model is adapted to a downstream task. Here, we f irst explore the problem of “bias transfer”, where biases from the pre-trained model can persist even after adapting the model to the downstream task. We then introduce transfer influences, a framework for pinpointing the counterfactual impact of a pre-training datapoint on the final prediction. This framework enables us to isolate (and remove) detrimental points from the pre-training dataset to improve transfer learning performance.
first_indexed	2024-09-23T10:53:24Z
format	Thesis
id	mit-1721.1/153886
institution	Massachusetts Institute of Technology
last_indexed	2024-09-23T10:53:24Z
publishDate	2024
publisher	Massachusetts Institute of Technology
record_format	dspace
spelling	mit-1721.1/1538862024-03-22T03:49:35Z A Data-Based Perspective on Model Reliability Jain, Saachi Mądry, Aleksander Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science Neural networks can fail to generalize to real world data — particularly on subpopulations that might have been mislabelled, corrupted, or underrepresented during training. In such settings, the set of features that a model relies on, or its feature prior, often determines the model’s ultimate reliability. While many factors contribute to a model’s feature prior, recent evidence indicates that the training dataset often plays a pivotal role. This thesis therefore aims to build the foundation for a data-centric perspective on model reliability, by uncovering how the training dataset’s composition affects the model’s feature prior, and thus the mistakes the model tends to make. It advances this objective through two main thrusts: developing scalable tools for identifying model failure modes in large datasets in large datasets and investigating the impact of pre-training data on the reliability of transfer learning models. In the first thrust, we develop techniques for uncovering meaningful patterns of model errors, especially in settings where manual exploration is prohibitively expensive. This includes building a framework for generating counterfactual images to debug model behavior as well as introducing a technique for automatically identifying failure modes by distilling them as directions in a latent space. We also propose a data-based approach to mitigate such failures at their source, by isolating training examples that drive a targeted bias. to mitigate such failures at their source, by isolating training examples that drive a targeted bias. In the second thrust, we investigate the role of the pre-training data in the transfer learning setting, where a pre-trained model is adapted to a downstream task. Here, we f irst explore the problem of “bias transfer”, where biases from the pre-trained model can persist even after adapting the model to the downstream task. We then introduce transfer influences, a framework for pinpointing the counterfactual impact of a pre-training datapoint on the final prediction. This framework enables us to isolate (and remove) detrimental points from the pre-training dataset to improve transfer learning performance. Ph.D. 2024-03-21T19:13:36Z 2024-03-21T19:13:36Z 2024-02 2024-02-21T17:18:47.875Z Thesis https://hdl.handle.net/1721.1/153886 In Copyright - Educational Use Permitted Copyright retained by author(s) https://rightsstatements.org/page/InC-EDU/1.0/ application/pdf Massachusetts Institute of Technology
spellingShingle	Jain, Saachi A Data-Based Perspective on Model Reliability
title	A Data-Based Perspective on Model Reliability
title_full	A Data-Based Perspective on Model Reliability
title_fullStr	A Data-Based Perspective on Model Reliability
title_full_unstemmed	A Data-Based Perspective on Model Reliability
title_short	A Data-Based Perspective on Model Reliability
title_sort	data based perspective on model reliability
url	https://hdl.handle.net/1721.1/153886
work_keys_str_mv	AT jainsaachi adatabasedperspectiveonmodelreliability AT jainsaachi databasedperspectiveonmodelreliability

A Data-Based Perspective on Model Reliability

Similar Items