Understanding and Estimating the Adaptability of Domain-Invariant Representations

When the test distribution differs from the training distribution, machine learning models can perform poorly and wrongly overestimate their performance. In this work, we aim to better estimate the model’s performance under distribution shift, without supervision. To do so, we use a set of domain-in...

Full description

Bibliographic Details
Main Author: Chuang, Ching-Yao
Other Authors: Jegelka, Stefanie
Format: Thesis
Published: Massachusetts Institute of Technology 2022
Online Access:https://hdl.handle.net/1721.1/139150
_version_ 1826190843186249728
author Chuang, Ching-Yao
author2 Jegelka, Stefanie
author_facet Jegelka, Stefanie
Chuang, Ching-Yao
author_sort Chuang, Ching-Yao
collection MIT
description When the test distribution differs from the training distribution, machine learning models can perform poorly and wrongly overestimate their performance. In this work, we aim to better estimate the model’s performance under distribution shift, without supervision. To do so, we use a set of domain-invariant predictors as a proxy for the unknown, true target labels, where the error of this estimation is bounded by the target risk of the proxy model. Therefore, we study the generalization of domain-invariant representations and show that the complexity of the latent representation has a significant influence on the target risk. Empirically, our estimation approach can self-tune to find the optimal model complexity and the resulting models achieve good target generalization, and estimate target error of other models well. Applications of our results include model selection, deciding early stopping, error detection, and predicting the adaptability of a model between domains.
first_indexed 2024-09-23T08:46:10Z
format Thesis
id mit-1721.1/139150
institution Massachusetts Institute of Technology
last_indexed 2024-09-23T08:46:10Z
publishDate 2022
publisher Massachusetts Institute of Technology
record_format dspace
spelling mit-1721.1/1391502022-01-15T04:05:02Z Understanding and Estimating the Adaptability of Domain-Invariant Representations Chuang, Ching-Yao Jegelka, Stefanie Torralba, Antonio Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science When the test distribution differs from the training distribution, machine learning models can perform poorly and wrongly overestimate their performance. In this work, we aim to better estimate the model’s performance under distribution shift, without supervision. To do so, we use a set of domain-invariant predictors as a proxy for the unknown, true target labels, where the error of this estimation is bounded by the target risk of the proxy model. Therefore, we study the generalization of domain-invariant representations and show that the complexity of the latent representation has a significant influence on the target risk. Empirically, our estimation approach can self-tune to find the optimal model complexity and the resulting models achieve good target generalization, and estimate target error of other models well. Applications of our results include model selection, deciding early stopping, error detection, and predicting the adaptability of a model between domains. S.M. 2022-01-14T14:52:54Z 2022-01-14T14:52:54Z 2021-06 2021-06-24T19:20:15.437Z Thesis https://hdl.handle.net/1721.1/139150 In Copyright - Educational Use Permitted Copyright MIT http://rightsstatements.org/page/InC-EDU/1.0/ application/pdf Massachusetts Institute of Technology
spellingShingle Chuang, Ching-Yao
Understanding and Estimating the Adaptability of Domain-Invariant Representations
title Understanding and Estimating the Adaptability of Domain-Invariant Representations
title_full Understanding and Estimating the Adaptability of Domain-Invariant Representations
title_fullStr Understanding and Estimating the Adaptability of Domain-Invariant Representations
title_full_unstemmed Understanding and Estimating the Adaptability of Domain-Invariant Representations
title_short Understanding and Estimating the Adaptability of Domain-Invariant Representations
title_sort understanding and estimating the adaptability of domain invariant representations
url https://hdl.handle.net/1721.1/139150
work_keys_str_mv AT chuangchingyao understandingandestimatingtheadaptabilityofdomaininvariantrepresentations