Representation Learning for Extrapolation via Bilinear Transduction

Typical machine learning systems, such as deep neural networks, perform well at predicting on new examples that come from the same distribution as initial training data. However, these systems are not typically robust to examples that do not come from the same distribution as the training samples. T...

Full description

Bibliographic Details
Main Author: Spiride, Andrei
Other Authors: Agrawal, Pulkit
Format: Thesis
Published: Massachusetts Institute of Technology 2024
Online Access:https://hdl.handle.net/1721.1/156798
_version_ 1826216880412557312
author Spiride, Andrei
author2 Agrawal, Pulkit
author_facet Agrawal, Pulkit
Spiride, Andrei
author_sort Spiride, Andrei
collection MIT
description Typical machine learning systems, such as deep neural networks, perform well at predicting on new examples that come from the same distribution as initial training data. However, these systems are not typically robust to examples that do not come from the same distribution as the training samples. These testing samples are characterized as out-of-distribution (OOD). Using a proven bilinear transduction [1] method for accurately predicting on OOD examples, we propose a method to apply this framework to learned representations instead of hand designed state representations. This work is geared towards enabling the bilinear transduction approach to generalize to a wider range of data types and tasks when such designed representations are not available. We use deep neural networks to learn representations of certain data types, such as images, and apply bilinear transduction to these learned representations. This has the potential to further expand the out-of-support prediction capabilities of the bilinear transduction framework.
first_indexed 2024-09-23T16:54:37Z
format Thesis
id mit-1721.1/156798
institution Massachusetts Institute of Technology
last_indexed 2024-09-23T16:54:37Z
publishDate 2024
publisher Massachusetts Institute of Technology
record_format dspace
spelling mit-1721.1/1567982024-09-17T03:01:50Z Representation Learning for Extrapolation via Bilinear Transduction Spiride, Andrei Agrawal, Pulkit Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science Typical machine learning systems, such as deep neural networks, perform well at predicting on new examples that come from the same distribution as initial training data. However, these systems are not typically robust to examples that do not come from the same distribution as the training samples. These testing samples are characterized as out-of-distribution (OOD). Using a proven bilinear transduction [1] method for accurately predicting on OOD examples, we propose a method to apply this framework to learned representations instead of hand designed state representations. This work is geared towards enabling the bilinear transduction approach to generalize to a wider range of data types and tasks when such designed representations are not available. We use deep neural networks to learn representations of certain data types, such as images, and apply bilinear transduction to these learned representations. This has the potential to further expand the out-of-support prediction capabilities of the bilinear transduction framework. M.Eng. 2024-09-16T13:49:55Z 2024-09-16T13:49:55Z 2024-05 2024-07-11T14:37:02.155Z Thesis https://hdl.handle.net/1721.1/156798 Attribution-NonCommercial-NoDerivatives 4.0 International (CC BY-NC-ND 4.0) Copyright retained by author(s) https://creativecommons.org/licenses/by-nc-nd/4.0/ application/pdf Massachusetts Institute of Technology
spellingShingle Spiride, Andrei
Representation Learning for Extrapolation via Bilinear Transduction
title Representation Learning for Extrapolation via Bilinear Transduction
title_full Representation Learning for Extrapolation via Bilinear Transduction
title_fullStr Representation Learning for Extrapolation via Bilinear Transduction
title_full_unstemmed Representation Learning for Extrapolation via Bilinear Transduction
title_short Representation Learning for Extrapolation via Bilinear Transduction
title_sort representation learning for extrapolation via bilinear transduction
url https://hdl.handle.net/1721.1/156798
work_keys_str_mv AT spirideandrei representationlearningforextrapolationviabilineartransduction