Differential motif enrichment analysis of paired ChIP-seq experiments.

Tom Lesluyes, James Johnson, Philip Machanick, Timothy L Bailey
Author Information
  1. Timothy L Bailey: Institute for Molecular Bioscience, The University of Queensland, 306 Carmody Road, 4072 Brisbane, Australia. t.bailey@imb.uq.edu.au.

Abstract

BACKGROUND: Motif enrichment analysis of transcription factor ChIP-seq data can help identify transcription factors that cooperate or compete. Previously, little attention has been given to comparative motif enrichment analysis of pairs of ChIP-seq experiments, where the binding of the same transcription factor is assayed under different conditions. Such comparative analysis could potentially identify the distinct regulatory partners/competitors of the assayed transcription factor under different conditions or at different stages of development.
RESULTS: We describe a new methodology for identifying sequence motifs that are differentially enriched in one set of DNA or RNA sequences relative to another set, and apply it to paired ChIP-seq experiments. We show that, using paired ChIP-seq data for a single transcription factor, differential motif enrichment analysis identifies all the known key transcription factors involved in the transformation of non-cancerous immortalized breast cells (MCF10A-ER-Src cells) into cancer stem cells whereas non-differential motif enrichment analysis does not. We also show that differential motif enrichment analysis identifies regulatory motifs that are significantly enriched at constrained locations within the bound promoters, and that these motifs are not identified by non-differential motif enrichment analysis. Our methodology differs from other approaches in that it leverages both comparative enrichment and positional enrichment of motifs in ChIP-seq peak regions or in the promoters of genes bound by the transcription factor.
CONCLUSIONS: We show that differential motif enrichment analysis of paired ChIP-seq experiments offers biological insights not available from non-differential analysis. In contrast to previous approaches, our method detects motifs that are enriched in a constrained region in one set of sequences, but not enriched in the same region in the comparative set. We have enhanced the web-based CentriMo algorithm to allow it to perform the constrained differential motif enrichment analysis described in this paper, and CentriMo's on-line interface (http://meme.ebi.edu.au) provides dozens of databases of DNA- and RNA-binding motifs from a full range of organisms. All data and output files presented here are available at http://research.imb.uq.edu.au/t.bailey/supplementary\_data/Lesluyes2014.

References

  1. Bioinformatics. 2000 Jan;16(1):16-23 [PMID: 10812473]
  2. Nucleic Acids Res. 2012 Sep 1;40(17):e128 [PMID: 22610855]
  3. Breast Cancer Res. 2012 Apr 18;14(2):R63 [PMID: 22513257]
  4. Nucleic Acids Res. 2014 Jan;42(Database issue):D142-7 [PMID: 24194598]
  5. BMC Bioinformatics. 2010 Apr 01;11:165 [PMID: 20356413]
  6. Proc Natl Acad Sci U S A. 1991 May 1;88(9):3720-4 [PMID: 1827203]
  7. Cancer Res. 1998 Oct 15;58(20):4611-5 [PMID: 9788612]
  8. Nature. 2013 Jul 11;499(7457):172-7 [PMID: 23846655]
  9. Nucleic Acids Res. 2010 Jan;38(Database issue):D105-10 [PMID: 19906716]
  10. Nucleic Acids Res. 2011 Aug;39(15):e98 [PMID: 21602262]
  11. Nat Protoc. 2014;9(6):1428-50 [PMID: 24853928]
  12. PLoS One. 2010 Jul 08;5(7):e11471 [PMID: 20628599]
  13. PLoS One. 2012;7(12):e49892 [PMID: 23284628]
  14. Oncogene. 2013 Oct 17;32(42):5111-22 [PMID: 23208501]
  15. Cell. 2009 Nov 13;139(4):693-706 [PMID: 19878981]
  16. Nucleic Acids Res. 2009 Jan;37(Database issue):D77-82 [PMID: 18842628]
  17. Nat Rev Cancer. 2010 Jan;10(1):65-76 [PMID: 20029425]
  18. Clin Cancer Res. 2001 Apr;7(4):818-23 [PMID: 11309328]
  19. Genome Res. 2013 Aug;23(8):1195-209 [PMID: 23595228]
  20. Nat Genet. 2006 Jun;38(6):626-35 [PMID: 16645617]
  21. Nature. 2012 Sep 6;489(7414):57-74 [PMID: 22955616]
  22. Cell. 2013 Jan 17;152(1-2):327-39 [PMID: 23332764]

Grants

  1. R01 GM103544/NIGMS NIH HHS
  2. R0-1 GM103544/NIGMS NIH HHS

MeSH Term

Binding Sites
Cell Line
Chromatin Immunoprecipitation
Computational Biology
High-Throughput Nucleotide Sequencing
Humans
Nucleotide Motifs
Position-Specific Scoring Matrices
Promoter Regions, Genetic
Protein Binding
Tamoxifen
Time Factors
Transcription Factors

Chemicals

Transcription Factors
Tamoxifen

Word Cloud

Created with Highcharts 10.0.0enrichmentanalysismotiftranscriptionChIP-seqmotifsfactorcomparativeexperimentsenrichedsetpaireddifferentialdatadifferentshowcellsnon-differentialconstrainedidentifyfactorsassayedconditionsregulatorymethodologyonesequencesidentifiesboundpromotersapproachesavailableregioneduBACKGROUND:MotifcanhelpcooperatecompetePreviouslylittleattentiongivenpairsbindingpotentiallydistinctpartners/competitorsstagesdevelopmentRESULTS:describenewidentifyingsequencedifferentiallyDNARNArelativeanotherapplyusingsingleknownkeyinvolvedtransformationnon-cancerousimmortalizedbreastMCF10A-ER-SrccancerstemwhereasalsosignificantlylocationswithinidentifieddiffersleveragespositionalpeakregionsgenesCONCLUSIONS:offersbiologicalinsightscontrastpreviousmethoddetectsenhancedweb-basedCentriMoalgorithmallowperformdescribedpaperCentriMo'son-lineinterfacehttp://memeebiauprovidesdozensdatabasesDNA-RNA-bindingfullrangeorganismsoutputfilespresentedhttp://researchimbuqau/tbailey/supplementary\_data/Lesluyes2014Differential

Similar Articles

Cited By