Algorithms for the analysis of protein interaction networks
Thesis (Ph. D.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Computer Science, 2012.
Main Author: | |
---|---|
Other Authors: | |
Format: | Thesis |
Language: | eng |
Published: |
Massachusetts Institute of Technology
2012
|
Subjects: | |
Online Access: | http://hdl.handle.net/1721.1/71489 |
_version_ | 1826206598895239168 |
---|---|
author | Singh, Rohit, Ph.D. Massachusetts Institute of Technology |
author2 | Bonnie Berger. |
author_facet | Bonnie Berger. Singh, Rohit, Ph.D. Massachusetts Institute of Technology |
author_sort | Singh, Rohit, Ph.D. Massachusetts Institute of Technology |
collection | MIT |
description | Thesis (Ph. D.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Computer Science, 2012. |
first_indexed | 2024-09-23T13:35:07Z |
format | Thesis |
id | mit-1721.1/71489 |
institution | Massachusetts Institute of Technology |
language | eng |
last_indexed | 2024-09-23T13:35:07Z |
publishDate | 2012 |
publisher | Massachusetts Institute of Technology |
record_format | dspace |
spelling | mit-1721.1/714892019-04-10T18:11:44Z Algorithms for the analysis of protein interaction networks Singh, Rohit, Ph.D. Massachusetts Institute of Technology Bonnie Berger. Massachusetts Institute of Technology. Dept. of Electrical Engineering and Computer Science. Massachusetts Institute of Technology. Dept. of Electrical Engineering and Computer Science. Electrical Engineering and Computer Science. Thesis (Ph. D.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Computer Science, 2012. Cataloged from PDF version of thesis. Includes bibliographical references (p. 107-117). In the decade since the human genome project, a major research trend in biology has been towards understanding the cell as a system. This interest has stemmed partly from a deeper appreciation of how important it is to understand the emergent properties of cellular systems (e.g., they seem to be the key to understanding diseases like cancer). It has also been enabled by new high-throughput techniques that have allowed us to collect new types of data at the whole-genome scale. We focus on one sub-domain of systems biology: the understanding of protein interactions. Such understanding is valuable: interactions between proteins are fundamental to many cellular processes. Over the last decade, high-throughput experimental techniques have allowed us to collect a large amount of protein-protein interaction (PPI) data for many species. A popular abstraction for representing this data is the protein interaction network: each node of the network represents a protein and an edge between two nodes represents a physical interaction between the two corresponding proteins. This abstraction has proven to be a powerful tool for understanding the systems aspects of protein interaction. We present some algorithms for the augmentation, cleanup and analysis of such protein interaction networks: 1. In many species, the coverage of known PPI data remains partial. Given two protein sequences, we describe an algorithm to predict if two proteins physically interact, using logistic regression and insights from structural biology. We also describe how our predictions may be further improved by combining with functional-genomic data. 2. We study systematic false positives in a popular experimental protocol, the Yeast 2-Hybrid method. Here, some "promiscuous" proteins may lead to many false positives. We describe a Bayesian approach to modeling and adjusting for this error. 3. Comparative analysis of PPI networks across species can provide valuable insights. We describe IsoRank, an algorithm for global network alignment of multiple PPI networks. The algorithm first constructs an eigenvalue problem that encapsulates the network and sequence similarity constraints. The solution of the problem describes a k-partite graph that is further processed to find the alignment. 4. For a given signaling network, we describe an algorithm that combines RNA-interference data with PPI data to produce hypotheses about the structure of the signaling network. Our algorithm constructs a multi-commodity flow problem that expresses the constraints described by the data and finds a sparse solution to it. by Rohit Singh. Ph.D. 2012-07-02T15:47:26Z 2012-07-02T15:47:26Z 2012 2012 Thesis http://hdl.handle.net/1721.1/71489 795580316 eng M.I.T. theses are protected by copyright. They may be viewed from this source for any purpose, but reproduction or distribution in any format is prohibited without written permission. See provided URL for inquiries about permission. http://dspace.mit.edu/handle/1721.1/7582 117 p. application/pdf Massachusetts Institute of Technology |
spellingShingle | Electrical Engineering and Computer Science. Singh, Rohit, Ph.D. Massachusetts Institute of Technology Algorithms for the analysis of protein interaction networks |
title | Algorithms for the analysis of protein interaction networks |
title_full | Algorithms for the analysis of protein interaction networks |
title_fullStr | Algorithms for the analysis of protein interaction networks |
title_full_unstemmed | Algorithms for the analysis of protein interaction networks |
title_short | Algorithms for the analysis of protein interaction networks |
title_sort | algorithms for the analysis of protein interaction networks |
topic | Electrical Engineering and Computer Science. |
url | http://hdl.handle.net/1721.1/71489 |
work_keys_str_mv | AT singhrohitphdmassachusettsinstituteoftechnology algorithmsfortheanalysisofproteininteractionnetworks |