Thermodynamic state ensemble models of cis-regulation.

Marc S Sherman, Barak A Cohen
Author Information
  1. Marc S Sherman: Computational and Molecular Biophysics, Washington University in St. Louis, St. Louis, Missouri, United States of America.

Abstract

A major goal in computational biology is to develop models that accurately predict a gene's expression from its surrounding regulatory DNA. Here we present one class of such models, thermodynamic state ensemble models. We describe the biochemical derivation of the thermodynamic framework in simple terms, and lay out the mathematical components that comprise each model. These components include (1) the possible states of a promoter, where a state is defined as a particular arrangement of transcription factors bound to a DNA promoter, (2) the binding constants that describe the affinity of the protein-protein and protein-DNA interactions that occur in each state, and (3) whether each state is capable of transcribing. Using these components, we demonstrate how to compute a cis-regulatory function that encodes the probability of a promoter being active. Our intention is to provide enough detail so that readers with little background in thermodynamics can compose their own cis-regulatory functions. To facilitate this goal, we also describe a matrix form of the model that can be easily coded in any programming language. This formalism has great flexibility, which we show by illustrating how phenomena such as competition between transcription factors and cooperativity are readily incorporated into these models. Using this framework, we also demonstrate that Michaelis-like functions, another class of cis-regulatory models, are a subset of the thermodynamic framework with specific assumptions. By recasting Michaelis-like functions as thermodynamic functions, we emphasize the relationship between these models and delineate the specific circumstances representable by each approach. Application of thermodynamic state ensemble models is likely to be an important tool in unraveling the physical basis of combinatorial cis-regulation and in generating formalisms that accurately predict gene expression from DNA sequence.

References

Curr Pharm Des. 2007;13(14):1415-36 [PMID: 17504165]
J Natl Cancer Inst. 1999 Aug 4;91(15):1288-94 [PMID: 10433617]
J Mol Biol. 1985 Jan 20;181(2):211-30 [PMID: 3157005]
J Mol Biol. 2002 Nov 8;323(5):785-93 [PMID: 12417193]
Biophys J. 2004 Apr;86(4):1922-45 [PMID: 15041638]
Trends Biochem Sci. 2006 Feb;31(2):89-97 [PMID: 16403636]
Proc Natl Acad Sci U S A. 2003 Oct 14;100(21):11980-5 [PMID: 14530388]
Nat Genet. 2004 Dec;36(12):1331-9 [PMID: 15543148]
Cell. 2003 Dec 12;115(6):751-63 [PMID: 14675539]
Nature. 2009 Jan 8;457(7226):215-8 [PMID: 19029883]
Proc Natl Acad Sci U S A. 2002 Aug 6;99(16):10555-60 [PMID: 12145321]
PLoS Biol. 2009 Mar 31;7(3):e73 [PMID: 19338389]
Curr Opin Genet Dev. 2005 Apr;15(2):116-24 [PMID: 15797194]
Proc Natl Acad Sci U S A. 2003 Apr 29;100(9):5136-41 [PMID: 12702751]
Annu Rev Microbiol. 2003;57:441-66 [PMID: 14527287]
Bioinformatics. 2006 Jul 15;22(14):e141-9 [PMID: 16873464]
Proc Natl Acad Sci U S A. 2010 Sep 21;107(38):16743-8 [PMID: 20810924]
Nature. 2007 Jun 14;447(7146):799-816 [PMID: 17571346]
Nat Protoc. 2009;4(3):393-411 [PMID: 19265799]
Mol Syst Biol. 2010;6:341 [PMID: 20087339]
Mol Cell. 2008 May 23;30(4):486-97 [PMID: 18498750]
Theor Popul Biol. 2010 Feb;77(1):1-5 [PMID: 19818800]
Curr Biol. 2006 Jul 11;16(13):1358-65 [PMID: 16750631]
Curr Biol. 2003 Aug 19;13(16):1409-13 [PMID: 12932324]
Nature. 2008 Jan 31;451(7178):535-40 [PMID: 18172436]
Annu Rev Biophys. 2010;39:43-59 [PMID: 20192769]
Curr Opin Genet Dev. 2005 Apr;15(2):125-35 [PMID: 15797195]

Grants

  1. R01 GM078222/NIGMS NIH HHS
  2. R01 GM092910/NIGMS NIH HHS

MeSH Term

Computer Simulation
DNA
Gene Expression Regulation
Models, Chemical
Models, Genetic
Regulatory Sequences, Nucleic Acid
Thermodynamics
Transcription Factors
Transcriptional Activation

Chemicals

Transcription Factors
DNA

Word Cloud

Similar Articles

Cited By