Towards Deep Learning Models of Metabolism

Enzymes play a critical role in catalyzing the chemical reactions that underpin metabolic processes in living organisms. Despite their importance, a vast majority of enzymes remain uncharacterized, limiting our understanding of their potential roles in metabolism and disease. This thesis aims to add...

Full description

Bibliographic Details
Main Author: Chinn, Itamar
Other Authors: Barzilay, Regina
Format: Thesis
Published: Massachusetts Institute of Technology 2024
Online Access:https://hdl.handle.net/1721.1/156153
_version_ 1811085836235571200
author Chinn, Itamar
author2 Barzilay, Regina
author_facet Barzilay, Regina
Chinn, Itamar
author_sort Chinn, Itamar
collection MIT
description Enzymes play a critical role in catalyzing the chemical reactions that underpin metabolic processes in living organisms. Despite their importance, a vast majority of enzymes remain uncharacterized, limiting our understanding of their potential roles in metabolism and disease. This thesis aims to address this gap by leveraging recent advancements in protein and molecular modeling to predict the outcomes of enzymatic reactions and identify functions of unannotated enzymes. Two key contributions are highlighted. Firstly, a graph-based forward synthesis prediction model is introduced, which relies only on the molecular structure of the substrates and the enzyme’s primary sequence. By capturing the biochemical interaction between enzyme residues and substrate atoms, the model achieves better generalization to new chemistry, demonstrating significant improvements in predicting unseen products and showcasing its potential for drug metabolism prediction. The second contribution is CLIPZyme, a contrastive learning method for virtual enzyme screening that frames the task of identifying enzymes catalyzing a reaction of interest as a retrieval problem. CLIPZyme outperforms the baseline approach of screening enzymes via their enzyme commission (EC) number. The combination of CLIPZyme with EC prediction consistently yields improved results over either method alone. Both of these contributions aim to provide the initial building blocks to model entire complex metabolic networks with downstream applications including metabolic engineering and drug discovery.
first_indexed 2024-09-23T13:16:24Z
format Thesis
id mit-1721.1/156153
institution Massachusetts Institute of Technology
last_indexed 2024-09-23T13:16:24Z
publishDate 2024
publisher Massachusetts Institute of Technology
record_format dspace
spelling mit-1721.1/1561532024-08-15T03:27:13Z Towards Deep Learning Models of Metabolism Chinn, Itamar Barzilay, Regina Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science Enzymes play a critical role in catalyzing the chemical reactions that underpin metabolic processes in living organisms. Despite their importance, a vast majority of enzymes remain uncharacterized, limiting our understanding of their potential roles in metabolism and disease. This thesis aims to address this gap by leveraging recent advancements in protein and molecular modeling to predict the outcomes of enzymatic reactions and identify functions of unannotated enzymes. Two key contributions are highlighted. Firstly, a graph-based forward synthesis prediction model is introduced, which relies only on the molecular structure of the substrates and the enzyme’s primary sequence. By capturing the biochemical interaction between enzyme residues and substrate atoms, the model achieves better generalization to new chemistry, demonstrating significant improvements in predicting unseen products and showcasing its potential for drug metabolism prediction. The second contribution is CLIPZyme, a contrastive learning method for virtual enzyme screening that frames the task of identifying enzymes catalyzing a reaction of interest as a retrieval problem. CLIPZyme outperforms the baseline approach of screening enzymes via their enzyme commission (EC) number. The combination of CLIPZyme with EC prediction consistently yields improved results over either method alone. Both of these contributions aim to provide the initial building blocks to model entire complex metabolic networks with downstream applications including metabolic engineering and drug discovery. S.M. 2024-08-14T20:11:28Z 2024-08-14T20:11:28Z 2024-05 2024-07-10T12:59:30.770Z Thesis https://hdl.handle.net/1721.1/156153 Attribution-NonCommercial-NoDerivatives 4.0 International (CC BY-NC-ND 4.0) Copyright retained by author(s) https://creativecommons.org/licenses/by-nc-nd/4.0/ application/pdf Massachusetts Institute of Technology
spellingShingle Chinn, Itamar
Towards Deep Learning Models of Metabolism
title Towards Deep Learning Models of Metabolism
title_full Towards Deep Learning Models of Metabolism
title_fullStr Towards Deep Learning Models of Metabolism
title_full_unstemmed Towards Deep Learning Models of Metabolism
title_short Towards Deep Learning Models of Metabolism
title_sort towards deep learning models of metabolism
url https://hdl.handle.net/1721.1/156153
work_keys_str_mv AT chinnitamar towardsdeeplearningmodelsofmetabolism