Statistical physics and biological information : hydrophobicity patterns in protein design and differential motif finding in DNA

Thesis (Ph. D.)--Massachusetts Institute of Technology, Dept. of Physics, 2004.

Bibliographic Details
Main Author: Yahyanejad, Mehdi, 1975-
Other Authors: Christopher B. Burge and Mehran Kardar.
Format: Thesis
Language:en_US
Published: Massachusetts Institute of Technology 2005
Subjects:
Online Access:http://hdl.handle.net/1721.1/28647
_version_ 1826199026430640128
author Yahyanejad, Mehdi, 1975-
author2 Christopher B. Burge and Mehran Kardar.
author_facet Christopher B. Burge and Mehran Kardar.
Yahyanejad, Mehdi, 1975-
author_sort Yahyanejad, Mehdi, 1975-
collection MIT
description Thesis (Ph. D.)--Massachusetts Institute of Technology, Dept. of Physics, 2004.
first_indexed 2024-09-23T11:13:33Z
format Thesis
id mit-1721.1/28647
institution Massachusetts Institute of Technology
language en_US
last_indexed 2024-09-23T11:13:33Z
publishDate 2005
publisher Massachusetts Institute of Technology
record_format dspace
spelling mit-1721.1/286472019-04-11T09:04:14Z Statistical physics and biological information : hydrophobicity patterns in protein design and differential motif finding in DNA Hydrophobicity patterns in protein design and differential motif finding in DNA Yahyanejad, Mehdi, 1975- Christopher B. Burge and Mehran Kardar. Massachusetts Institute of Technology. Dept. of Physics. Massachusetts Institute of Technology. Dept. of Physics. Physics. Thesis (Ph. D.)--Massachusetts Institute of Technology, Dept. of Physics, 2004. Includes bibliographical references (p. 115-124). (cont.) is dictated by the solvent accessibility of structures. The distinct intrinsic tendencies of sequence and structure profiles are most pronounced at long periods, where sequence hydrophobicity fluctuates less, while solvent accessibility fluctuates more than average. Correlations between the two profiles can be interpreted as the Boltzmann weight of the solvation energy at room temperature. Chapter 4 shows that correlations in solvent accessibility along protein structures play a key role in the designability phenomenon, for both lattice and natural proteins. Without such correlations, as predicted by the Random Energy Model (REM), all structures will have almost equal values of designability. By using a toy, Ising-based model, we show that changing the correlations moves between a regime with no designability and a regime exhibiting the designability phenomenon, where a few highly designable structures emerge. Understanding how gene expression is regulated is one of the main goals of molecular cell biology. To reach this goal, the recognition and identification of DNA motifs--short patterns in biological sequences--is essential. Common examples of motifs include transcription factor binding sites in promoter regions of co-regulated genes and exonic and intronic splicing enhancers ... In the past decade, a large amount of biological data has been generated, enabling new quantitative approaches in biology. In this thesis, we focus on two biological questions by using techniques from statistical physics: hydrophobicity patterns in proteins and their impact on the designability of protein structures and regulatory motif finding in DNA sequences. Proteins fold into specific structures to perform their functions. Hydrophobicity is the main force of folding; protein sequences try to lower the ground state energy of the folded structure by burying hydrophobic monomers in the core. This results in patterns, or correlations, in the hydrophobic profiles of proteins. In this thesis, we study the designability phenomena: the vast majority of proteins adopt only a small number of distinct folded structures. In Chapter 2, we use principal component analysis to characterize the distribution of solvent accessibility profiles in an appropriate high-dimensional vector space and show that the distribution can be approximated with a Gaussian form. We also show that structures with solvent accessibility profiles dissimilar to the rest are more likely to be highly designable, offering an alternative to existing, computationally-intensive methods for identifying highly-designable structures. In Chapter 3, we extend our method to natural proteins. We use Fourier analysis to study the solvent accessibility and hydrophobicity profiles of natural proteins and show that their distribution can be approximated by a multi-variate Gaussian. The method allows us to separate the intrinsic tendencies of sequence and structure profiles from the interactions that correlate them; we conclude that the alpha-helix periodicity in sequence hydrophobicity by Mehdi Yahyanejad. Ph.D. 2005-09-27T17:29:56Z 2005-09-27T17:29:56Z 2004 2004 Thesis http://hdl.handle.net/1721.1/28647 58965031 en_US M.I.T. theses are protected by copyright. They may be viewed from this source for any purpose, but reproduction or distribution in any format is prohibited without written permission. See provided URL for inquiries about permission. http://dspace.mit.edu/handle/1721.1/7582 124 p. 4734447 bytes 4750021 bytes application/pdf application/pdf application/pdf Massachusetts Institute of Technology
spellingShingle Physics.
Yahyanejad, Mehdi, 1975-
Statistical physics and biological information : hydrophobicity patterns in protein design and differential motif finding in DNA
title Statistical physics and biological information : hydrophobicity patterns in protein design and differential motif finding in DNA
title_full Statistical physics and biological information : hydrophobicity patterns in protein design and differential motif finding in DNA
title_fullStr Statistical physics and biological information : hydrophobicity patterns in protein design and differential motif finding in DNA
title_full_unstemmed Statistical physics and biological information : hydrophobicity patterns in protein design and differential motif finding in DNA
title_short Statistical physics and biological information : hydrophobicity patterns in protein design and differential motif finding in DNA
title_sort statistical physics and biological information hydrophobicity patterns in protein design and differential motif finding in dna
topic Physics.
url http://hdl.handle.net/1721.1/28647
work_keys_str_mv AT yahyanejadmehdi1975 statisticalphysicsandbiologicalinformationhydrophobicitypatternsinproteindesignanddifferentialmotiffindingindna
AT yahyanejadmehdi1975 hydrophobicitypatternsinproteindesignanddifferentialmotiffindingindna