Robust modeling of differential gene expression data using normal/independent distributions: a Bayesian approach.

Mojtaba Ganjali, Taban Baghfalaki, Damon Berridge
Author Information
  1. Mojtaba Ganjali: School of Biological Science, Institute for Research in Fundamental Sciences (IPM), Tehran, Iran; Department of Statistics, Faculty of Mathematical Sciences, Shahid Beheshti University, Tehran, Iran.
  2. Taban Baghfalaki: School of Biological Science, Institute for Research in Fundamental Sciences (IPM), Tehran, Iran; Department of Statistics, Faculty of Mathematical Sciences, Tarbiat Modares University, Tehran, Iran.
  3. Damon Berridge: Farr Institute-CIPHER, College of Medicine, Swansea University, Swansea, Wales, U.K.

Abstract

In this paper, the problem of identifying differentially expressed genes under different conditions using gene expression microarray data, in the presence of outliers, is discussed. For this purpose, the robust modeling of gene expression data using some powerful distributions known as normal/independent distributions is considered. These distributions include the Student's t and normal distributions which have been used previously, but also include extensions such as the slash, the contaminated normal and the Laplace distributions. The purpose of this paper is to identify differentially expressed genes by considering these distributional assumptions instead of the normal distribution. A Bayesian approach using the Markov Chain Monte Carlo method is adopted for parameter estimation. Two publicly available gene expression data sets are analyzed using the proposed approach. The use of the robust models for detecting differentially expressed genes is investigated. This investigation shows that the choice of model for differentiating gene expression data is very important. This is due to the small number of replicates for each gene and the existence of outlying data. Comparison of the performance of these models is made using different statistical criteria and the ROC curve. The method is illustrated using some simulation studies. We demonstrate the flexibility of these robust models in identifying differentially expressed genes.

References

  1. BMC Bioinformatics. 2007;8:230 [PMID: 17603887]
  2. J Biomed Opt. 1997 Oct;2(4):364-74 [PMID: 23014960]
  3. Biometrics. 2008 Jun;64(2):468-78 [PMID: 17888037]
  4. BMC Res Notes. 2012;5:46 [PMID: 22260205]
  5. Nat Immunol. 2012 Mar;13(3):199-203 [PMID: 22344273]
  6. Biometrics. 2006 Mar;62(1):10-8 [PMID: 16542223]
  7. Biometrics. 2003 Sep;59(3):542-54 [PMID: 14601755]
  8. Stat Med. 2003 Dec 30;22(24):3899-914 [PMID: 14673946]
  9. Bioinformatics. 2001 Jun;17(6):509-19 [PMID: 11395427]
  10. Biostatistics. 2006 Oct;7(4):630-41 [PMID: 16565148]
  11. N Engl J Med. 2001 Feb 22;344(8):539-48 [PMID: 11207349]
  12. Proc Natl Acad Sci U S A. 2001 Apr 24;98(9):5116-21 [PMID: 11309499]
  13. J Comput Biol. 2001;8(1):37-52 [PMID: 11339905]
  14. Bioinformatics. 2011 Mar 15;27(6):807-14 [PMID: 21252077]
  15. Science. 1999 Oct 15;286(5439):531-7 [PMID: 10521349]

Grants

  1. MR/K006525/1/Medical Research Council

MeSH Term

Algorithms
Bayes Theorem
Breast Neoplasms
Computational Biology
Gene Expression Profiling
Gene Expression Regulation
Humans
Leukemia
Models, Statistical
Normal Distribution
ROC Curve

Word Cloud

Created with Highcharts 10.0.0usinggenedataexpressiondistributionsdifferentiallyexpressedgenesrobustnormalapproachmodelspaperidentifyingdifferentpurposemodelingnormal/independentincludeBayesianmethodproblemconditionsmicroarraypresenceoutliersdiscussedpowerfulknownconsideredStudent'stusedpreviouslyalsoextensionsslashcontaminatedLaplaceidentifyconsideringdistributionalassumptionsinsteaddistributionMarkovChainMonteCarloadoptedparameterestimationTwopubliclyavailablesetsanalyzedproposedusedetectinginvestigatedinvestigationshowschoicemodeldifferentiatingimportantduesmallnumberreplicatesexistenceoutlyingComparisonperformancemadestatisticalcriteriaROCcurveillustratedsimulationstudiesdemonstrateflexibilityRobustdifferentialdistributions:

Similar Articles

Cited By