Comparison of methods for competitive tests of pathway analysis.

Marina Evangelou, Augusto Rendon, Willem H Ouwehand, Lorenz Wernisch, Frank Dudbridge
Author Information
  1. Marina Evangelou: Medical Research Council Biostatistics Unit, Institute of Public Health, Cambridge, United Kingdom.

Abstract

It has been suggested that pathway analysis can complement single-SNP analysis in exploring genomewide association data. Pathway analysis incorporates the available biological knowledge of genes and SNPs and is expected to improve the chances of revealing the underlying genetic architecture of complex traits. Methods for pathway analysis can be classified as competitive (enrichment) or self-contained (association) according to the hypothesis tested. Although association tests are statistically more powerful than enrichment tests they can be difficult to calibrate because biases in analysis accumulate across multiple SNPs or genes. Furthermore, enrichment tests can be more scientifically relevant than association tests, as they detect pathways with relatively more evidence for association than the remaining genes. Here we show how some well known association tests can be simply adapted to test for enrichment, and compare their performance to some established enrichment tests. We propose versions of the Adaptive Rank Truncated Product (ARTP), Tail Strength Measure and Fisher's combination of p-values for testing the enrichment null hypothesis. We compare the behaviour of these proposed methods with the established Hypergeometric Test and Gene-Set Enrichment Analysis (GSEA). The results of the simulation study show that the modified version of the ARTP method has generally the best performance across the situations considered. The methods were also applied for finding enriched pathways for body mass index (BMI) and platelet function phenotypes. The pathway analysis of BMI identified the Vasoactive Intestinal Peptide pathway as significantly associated with BMI. This pathway has been previously reported as associated with BMI and the risk of obesity. The ARTP method was the method that identified the largest number of enriched pathways across all tested pathway databases and phenotypes. The simulation and data application results are in agreement with previous work on association tests and suggests that the ARTP should be preferred for both enrichment and association testing.

References

  1. Br J Cancer. 1999 Jul;80 Suppl 1:95-103 [PMID: 10466767]
  2. PLoS Genet. 2005 Sep;1(3):e32 [PMID: 16151517]
  3. BMC Proc. 2009 Dec 15;3 Suppl 7:S96 [PMID: 20018093]
  4. Genet Epidemiol. 2009 Jul;33(5):419-31 [PMID: 19235186]
  5. Biostatistics. 2006 Apr;7(2):167-81 [PMID: 16332926]
  6. Am J Hum Genet. 2009 Jul;85(1):13-24 [PMID: 19539887]
  7. Nucleic Acids Res. 2000 Jan 1;28(1):27-30 [PMID: 10592173]
  8. Nucleic Acids Res. 2010 Jul;38(Web Server issue):W749-54 [PMID: 20501604]
  9. J Thromb Haemost. 2007 Aug;5(8):1756-65 [PMID: 17663743]
  10. Genet Epidemiol. 2003 Dec;25(4):360-6 [PMID: 14639705]
  11. Obesity (Silver Spring). 2010 Dec;18(12):2339-46 [PMID: 20379146]
  12. Bioinformatics. 2009 Oct 15;25(20):2762-3 [PMID: 19620097]
  13. Genet Epidemiol. 2009 May;33(4):290-8 [PMID: 19051284]
  14. Am J Hum Genet. 2007 Dec;81(6):1278-83 [PMID: 17966091]
  15. Nucleic Acids Res. 2011 Jan;39(Database issue):D691-7 [PMID: 21067998]
  16. Genet Epidemiol. 2009 Sep;33(6):497-507 [PMID: 19170135]
  17. Genet Epidemiol. 2008 Nov;32(7):658-68 [PMID: 18481796]
  18. Blood. 2009 Aug 13;114(7):1405-16 [PMID: 19429868]
  19. Bioinformatics. 2009 Jan 15;25(2):237-42 [PMID: 19029127]
  20. Nat Rev Genet. 2010 Dec;11(12):843-54 [PMID: 21085203]
  21. Bioinformatics. 2007 Apr 15;23(8):980-7 [PMID: 17303618]
  22. Genet Epidemiol. 2008 Sep;32(6):560-6 [PMID: 18428428]
  23. Biometrika. 1949 Dec;36(3-4):370-82 [PMID: 15402072]
  24. Bioinformatics. 2004 Jan 1;20(1):93-9 [PMID: 14693814]
  25. PLoS One. 2010 Sep 17;5(9): [PMID: 20862301]
  26. Genet Epidemiol. 2009 Dec;33(8):700-9 [PMID: 19333968]
  27. Nucleic Acids Res. 2009 Jan;37(Database issue):D619-22 [PMID: 18981052]

Grants

  1. G1000718/Medical Research Council
  2. MC_U105260799/Medical Research Council
  3. RG/09/012/28096/British Heart Foundation
  4. RP-PG-0310-1002/Department of Health

MeSH Term

Algorithms
Blood Platelets
Body Mass Index
Computer Simulation
Genome-Wide Association Study
Humans
Metabolic Networks and Pathways
Models, Genetic
Molecular Sequence Annotation
Phenotype
Polymorphism, Single Nucleotide

Word Cloud

Created with Highcharts 10.0.0associationtestspathwayanalysisenrichmentcanARTPBMIgenesacrosspathwaysmethodsmethoddataSNPscompetitivehypothesistestedshowcompareperformanceestablishedtestingresultssimulationenrichedphenotypesidentifiedassociatedsuggestedcomplementsingle-SNPexploringgenomewidePathwayincorporatesavailablebiologicalknowledgeexpectedimprovechancesrevealingunderlyinggeneticarchitecturecomplextraitsMethodsclassifiedself-containedaccordingAlthoughstatisticallypowerfuldifficultcalibratebiasesaccumulatemultipleFurthermorescientificallyrelevantdetectrelativelyevidenceremainingwellknownsimplyadaptedtestproposeversionsAdaptiveRankTruncatedProductTailStrengthMeasureFisher'scombinationp-valuesnullbehaviourproposedHypergeometricTestGene-SetEnrichmentAnalysisGSEAstudymodifiedversiongenerallybestsituationsconsideredalsoappliedfindingbodymassindexplateletfunctionVasoactiveIntestinalPeptidesignificantlypreviouslyreportedriskobesitylargestnumberdatabasesapplicationagreementpreviousworksuggestspreferredComparison

Similar Articles

Cited By