Inferring Gene Regulatory Networks from Multiple Datasets.

Christopher A Penfold, Iulia Gherman, Anastasiya Sybirna, David L Wild
Author Information
  1. Christopher A Penfold: Wellcome/CRUK Gurdon Institute, University of Cambridge, Cambridge, UK. cap76@cam.ac.uk.
  2. Iulia Gherman: Warwick Integrative Synthetic Biology Centre, School of Engineering, University of Warwick, Coventry, UK.
  3. Anastasiya Sybirna: Wellcome/CRUK Gurdon Institute, University of Cambridge, Cambridge, UK.
  4. David L Wild: Department of Statistics and Systems Biology Centre, University of Warwick, Coventry, UK.

Abstract

Gaussian process dynamical systems (GPDS) represent Bayesian nonparametric approaches to inference of nonlinear dynamical systems, and provide a principled framework for the learning of biological networks from multiple perturbed time series measurements of gene or protein expression. Such approaches are able to capture the full richness of complex ODE models, and can be scaled for inference in moderately large systems containing hundreds of genes. Related hierarchical approaches allow for inference from multiple datasets in which the underlying generative networks are assumed to have been rewired, either by context-dependent changes in network structure, evolutionary processes, or synthetic manipulation. These approaches can also be used to leverage experimentally determined network structures from one species into another where the network structure is unknown. Collectively, these methods provide a comprehensive and flexible platform for inference from a diverse range of data, with applications in systems and synthetic biology, as well as spatiotemporal modelling of embryo development. In this chapter we provide an overview of GPDS approaches and highlight their applications in the biological sciences, with accompanying tutorials available as a Jupyter notebook from https://github.com/cap76/GPDS .

Keywords

Grants

  1. MC_PC_12009/Medical Research Council
  2. BB/L014130/1/Biotechnology and Biological Sciences Research Council
  3. 083089/Z/07/Z/Wellcome Trust

MeSH Term

Algorithms
Bayes Theorem
Datasets as Topic
Gene Expression Profiling
Gene Regulatory Networks
Models, Genetic
Normal Distribution
Spatio-Temporal Analysis
Systems Biology

Word Cloud

Created with Highcharts 10.0.0systemsapproachesdynamicalinferenceprovidemultiplenetworkstructureGaussianprocessGPDSbiologicalnetworksmodelscansyntheticdataapplicationsrepresentBayesiannonparametricnonlinearprincipledframeworklearningperturbedtimeseriesmeasurementsgeneproteinexpressionablecapturefullrichnesscomplexODEscaledmoderatelylargecontaininghundredsgenesRelatedhierarchicalallowdatasetsunderlyinggenerativeassumedrewiredeithercontext-dependentchangesevolutionaryprocessesmanipulationalsousedleverageexperimentallydeterminedstructuresonespeciesanotherunknownCollectivelymethodscomprehensiveflexibleplatformdiverserangebiologywellspatiotemporalmodellingembryodevelopmentchapteroverviewhighlightsciencesaccompanyingtutorialsavailableJupyternotebookhttps://githubcom/cap76/GPDSInferringGeneRegulatoryNetworksMultipleDatasetsCausalidentificationLearningsourcesNonlinearSpatiotemporal

Similar Articles

Cited By