ASSIGN: context-specific genomic profiling of multiple heterogeneous biological pathways.

Ying Shen, Mumtahena Rahman, Stephen R Piccolo, Daniel Gusenleitner, Nader N El-Chaar, Luis Cheng, Stefano Monti, Andrea H Bild, W Evan Johnson
Author Information
  1. Ying Shen: Division of Computational Biomedicine, Boston University School of Medicine, Boston, MA 02118 USA, Department of Biomedical Informatics and Department of Pharmacology and Toxicology, University of Utah, Salt Lake City, UT 84112 USA.
  2. Mumtahena Rahman: Division of Computational Biomedicine, Boston University School of Medicine, Boston, MA 02118 USA, Department of Biomedical Informatics and Department of Pharmacology and Toxicology, University of Utah, Salt Lake City, UT 84112 USA.
  3. Stephen R Piccolo: Division of Computational Biomedicine, Boston University School of Medicine, Boston, MA 02118 USA, Department of Biomedical Informatics and Department of Pharmacology and Toxicology, University of Utah, Salt Lake City, UT 84112 USA Division of Computational Biomedicine, Boston University School of Medicine, Boston, MA 02118 USA, Department of Biomedical Informatics and Department of Pharmacology and Toxicology, University of Utah, Salt Lake City, UT 84112 USA.
  4. Daniel Gusenleitner: Division of Computational Biomedicine, Boston University School of Medicine, Boston, MA 02118 USA, Department of Biomedical Informatics and Department of Pharmacology and Toxicology, University of Utah, Salt Lake City, UT 84112 USA.
  5. Nader N El-Chaar: Division of Computational Biomedicine, Boston University School of Medicine, Boston, MA 02118 USA, Department of Biomedical Informatics and Department of Pharmacology and Toxicology, University of Utah, Salt Lake City, UT 84112 USA.
  6. Luis Cheng: Division of Computational Biomedicine, Boston University School of Medicine, Boston, MA 02118 USA, Department of Biomedical Informatics and Department of Pharmacology and Toxicology, University of Utah, Salt Lake City, UT 84112 USA.
  7. Stefano Monti: Division of Computational Biomedicine, Boston University School of Medicine, Boston, MA 02118 USA, Department of Biomedical Informatics and Department of Pharmacology and Toxicology, University of Utah, Salt Lake City, UT 84112 USA.
  8. Andrea H Bild: Division of Computational Biomedicine, Boston University School of Medicine, Boston, MA 02118 USA, Department of Biomedical Informatics and Department of Pharmacology and Toxicology, University of Utah, Salt Lake City, UT 84112 USA Division of Computational Biomedicine, Boston University School of Medicine, Boston, MA 02118 USA, Department of Biomedical Informatics and Department of Pharmacology and Toxicology, University of Utah, Salt Lake City, UT 84112 USA.
  9. W Evan Johnson: Division of Computational Biomedicine, Boston University School of Medicine, Boston, MA 02118 USA, Department of Biomedical Informatics and Department of Pharmacology and Toxicology, University of Utah, Salt Lake City, UT 84112 USA Division of Computational Biomedicine, Boston University School of Medicine, Boston, MA 02118 USA, Department of Biomedical Informatics and Department of Pharmacology and Toxicology, University of Utah, Salt Lake City, UT 84112 USA.

Abstract

MOTIVATION: Although gene-expression signature-based biomarkers are often developed for clinical diagnosis, many promising signatures fail to replicate during validation. One major challenge is that biological samples used to generate and validate the signature are often from heterogeneous biological contexts-controlled or in vitro samples may be used to generate the signature, but patient samples may be used for validation. In addition, systematic technical biases from multiple genome-profiling platforms often mask true biological variation. Addressing such challenges will enable us to better elucidate disease mechanisms and provide improved guidance for personalized therapeutics.
RESULTS: Here, we present a pathway profiling toolkit, Adaptive Signature Selection and InteGratioN (ASSIGN), which enables robust and context-specific pathway analyses by efficiently capturing pathway activity in heterogeneous sets of samples and across profiling technologies. The ASSIGN framework is based on a flexible Bayesian factor analysis approach that allows for simultaneous profiling of multiple correlated pathways and for the adaptation of pathway signatures into specific disease. We demonstrate the robustness and versatility of ASSIGN in estimating pathway activity in simulated data, cell lines perturbed pathways and in primary tissues samples including The Cancer Genome Atlas breast carcinoma samples and liver samples exposed to genotoxic carcinogens.
AVAILABILITY AND IMPLEMENTATION: Software for our approach is available for download at: http://www.bioconductor.org/packages/release/bioc/html/ASSIGN.html and https://github.com/wevanjohnson/ASSIGN.

References

  1. Proc Natl Acad Sci U S A. 2005 Oct 25;102(43):15545-50 [PMID: 16199517]
  2. Proc Natl Acad Sci U S A. 2013 Oct 29;110(44):17778-83 [PMID: 24128763]
  3. Mol Syst Biol. 2013;9:673 [PMID: 23752269]
  4. Bioinformatics. 2007 Oct 1;23(19):2507-17 [PMID: 17720704]
  5. BMC Genomics. 2008;9 Suppl 1:S13 [PMID: 18366602]
  6. Mutat Res. 2000 Nov 20;455(1-2):29-60 [PMID: 11113466]
  7. Bioinformatics. 2009 May 1;25(9):1105-11 [PMID: 19289445]
  8. Nat Rev Mol Cell Biol. 2011 Feb;12(2):104-17 [PMID: 21252999]
  9. J Biotechnol. 2005 Sep 29;119(3):219-44 [PMID: 16005536]
  10. Biostatistics. 2010 Apr;11(2):242-53 [PMID: 20097884]
  11. Nat Biotechnol. 2010 May;28(5):511-5 [PMID: 20436464]
  12. Int J Cancer. 2010 Mar 1;126(5):1121-31 [PMID: 19685490]
  13. Nature. 2009 Nov 5;462(7269):108-12 [PMID: 19847166]
  14. Sci Transl Med. 2010 Apr 7;2(26):26ra25 [PMID: 20375364]
  15. Biometrika. 2011 Jun;98(2):291-306 [PMID: 23049129]
  16. Bioinformatics. 2012 Oct 15;28(20):2662-70 [PMID: 22923307]
  17. Nucleic Acids Res. 2010 Oct;38(18):e178 [PMID: 20802226]
  18. Bioinformatics. 2011 Jun 15;27(12):1739-40 [PMID: 21546393]
  19. BMC Bioinformatics. 2011;12:323 [PMID: 21816040]
  20. Clin Cancer Res. 2009 Jul 15;15(14):4649-64 [PMID: 19567590]
  21. Biostatistics. 2007 Jan;8(1):118-27 [PMID: 16632515]
  22. BMC Bioinformatics. 2013;14:99 [PMID: 23506672]
  23. Nature. 2006 Jan 19;439(7074):353-7 [PMID: 16273092]
  24. Med Ref Serv Q. 2008 Fall;27(3):303-11 [PMID: 19042710]
  25. Nature. 2012 Oct 4;490(7418):61-70 [PMID: 23000897]
  26. Nat Genet. 2005 Jan;37(1):48-55 [PMID: 15608639]
  27. Breast Cancer Res. 2013;15(1):R16 [PMID: 23448424]
  28. Genomics. 2012 Dec;100(6):337-44 [PMID: 22959562]
  29. Pharmacogenomics. 2002 May;3(3):403-15 [PMID: 12052147]
  30. N Engl J Med. 2002 Dec 19;347(25):1999-2009 [PMID: 12490681]
  31. Nat Genet. 2000 May;25(1):25-9 [PMID: 10802651]
  32. Science. 2002 Jul 5;297(5578):63-4 [PMID: 12098689]
  33. Clin Cancer Res. 2008 Mar 1;14(5):1368-76 [PMID: 18316557]
  34. Science. 1999 Oct 15;286(5439):531-7 [PMID: 10521349]
  35. Nucleic Acids Res. 2014 Jan;42(Database issue):D199-205 [PMID: 24214961]

Grants

  1. T15 LM007124/NLM NIH HHS
  2. U01 CA164720/NCI NIH HHS
  3. T15LM007124/NLM NIH HHS
  4. U01CA164720/NCI NIH HHS

MeSH Term

Animals
Bayes Theorem
Breast Neoplasms
Female
Gene Expression Profiling
Genomics
Humans
Rats
Signal Transduction
Software