MetSizeR: selecting the optimal sample size for metabolomic studies using an analysis based approach.

Gift Nyamundanda, Isobel Claire Gormley, Yue Fan, William M Gallagher, Lorraine Brennan
Author Information
  1. Gift Nyamundanda: School of Mathematical Sciences, University College Dublin, Dublin, Ireland. claire.gormley@ucd.ie.

Abstract

BACKGROUND: Determining sample sizes for metabolomic experiments is important but due to the complexity of these experiments, there are currently no standard methods for sample size estimation in metabolomics. Since pilot studies are rarely done in metabolomics, currently existing sample size estimation approaches which rely on pilot data can not be applied.
RESULTS: In this article, an analysis based approach called MetSizeR is developed to estimate sample size for metabolomic experiments even when experimental pilot data are not available. The key motivation for MetSizeR is that it considers the type of analysis the researcher intends to use for data analysis when estimating sample size. MetSizeR uses information about the data analysis technique and prior expert knowledge of the metabolomic experiment to simulate pilot data from a statistical model. Permutation based techniques are then applied to the simulated pilot data to estimate the required sample size.
CONCLUSIONS: The MetSizeR methodology, and a publicly available software package which implements the approach, are illustrated through real metabolomic applications. Sample size estimates, informed by the intended statistical analysis technique, and the associated uncertainty are provided.

References

  1. Drug Chem Toxicol. 2002 Nov;25(4):375-82 [PMID: 12378948]
  2. Neurochem Int. 2010 Jan;56(2):340-4 [PMID: 19913064]
  3. BMC Bioinformatics. 2010 Nov 23;11:571 [PMID: 21092268]
  4. Bioinformatics. 2002 Sep;18(9):1184-93 [PMID: 12217910]
  5. Nat Rev Mol Cell Biol. 2012 Mar 22;13(4):263-9 [PMID: 22436749]
  6. Mass Spectrom Rev. 2007 Jan-Feb;26(1):51-78 [PMID: 16921475]
  7. BMC Bioinformatics. 2006 Mar 02;7:106 [PMID: 16512900]
  8. Bioinformatics. 2011 Jul 15;27(14):1979-85 [PMID: 21729866]
  9. Bioinformatics. 2007 Mar 15;23(6):739-46 [PMID: 17237060]
  10. BMC Bioinformatics. 2010 Jan 25;11:48 [PMID: 20100337]

MeSH Term

Algorithms
Animals
Computer Simulation
Longitudinal Studies
Metabolomics
Models, Statistical
Nuclear Magnetic Resonance, Biomolecular
Pilot Projects
Principal Component Analysis
Sample Size
Software

Word Cloud

Created with Highcharts 10.0.0samplesizedataanalysismetabolomicpilotMetSizeRexperimentsbasedapproachcurrentlyestimationmetabolomicsstudiesappliedestimateavailabletechniquestatisticalBACKGROUND:DeterminingsizesimportantduecomplexitystandardmethodsSincerarelydoneexistingapproachesrelycanRESULTS:articlecalleddevelopedevenexperimentalkeymotivationconsiderstyperesearcherintendsuseestimatingusesinformationpriorexpertknowledgeexperimentsimulatemodelPermutationtechniquessimulatedrequiredCONCLUSIONS:methodologypubliclysoftwarepackageimplementsillustratedrealapplicationsSampleestimatesinformedintendedassociateduncertaintyprovidedMetSizeR:selectingoptimalusing

Similar Articles

Cited By