A streamlined workflow for long-read DNA methylation analysis with NanoMethViz and Bioconductor.

Shian Su, Lucinda Xiao, James Lancaster, Tamara Cameron, Kelsey Breslin, Peter F Hickey, Marnie E Blewitt, Quentin Gouil, Matthew E Ritchie
Author Information
  1. Shian Su: Walter and Eliza Hall Institute of Medical Research, Parkville, Victoria, Australia. ORCID
  2. Lucinda Xiao: Walter and Eliza Hall Institute of Medical Research, Parkville, Victoria, Australia.
  3. James Lancaster: Walter and Eliza Hall Institute of Medical Research, Parkville, Victoria, Australia.
  4. Tamara Cameron: Walter and Eliza Hall Institute of Medical Research, Parkville, Victoria, Australia.
  5. Kelsey Breslin: Walter and Eliza Hall Institute of Medical Research, Parkville, Victoria, Australia. ORCID
  6. Peter F Hickey: Walter and Eliza Hall Institute of Medical Research, Parkville, Victoria, Australia. ORCID
  7. Marnie E Blewitt: Walter and Eliza Hall Institute of Medical Research, Parkville, Victoria, Australia.
  8. Quentin Gouil: Walter and Eliza Hall Institute of Medical Research, Parkville, Victoria, Australia.
  9. Matthew E Ritchie: Walter and Eliza Hall Institute of Medical Research, Parkville, Victoria, Australia. ORCID

Abstract

Long-read sequencing technologies have transformed the field of epigenetics by enabling direct, single-base resolution detection of DNA modifications, such as methylation. This produces novel opportunities for studying the role of DNA methylation in gene regulation, imprinting, and disease. However, the unique characteristics of long-read data, including the modBAM format and extended read lengths, necessitate the development of specialised software tools for effective analysis. The NanoMethViz package provides a suite of tools for loading in long-read methylation data, visualising data at various data resolutions. It can convert the data for use with other Bioconductor software such as bsseq, DSS, dmrseq and edgeR to discover differentially methylated regions (DMRs). In this workflow article, we demonstrate the process of converting modBAM files into formats suitable for comprehensive downstream analysis. We leverage NanoMethViz to conduct an exploratory analysis, visually summarizing differences between samples, examining aggregate methylation profiles across gene and CpG islands, and investigating methylation patterns within specific regions at the single-read level. Additionally, we illustrate the use of dmrseq for identifying DMRs and show how to integrate these findings into gene-level visualization plots. Our analysis is applied to a triplicate dataset of haplotyped long-read methylation data from mouse neural stem cells, allowing us to visualize and compare the characteristics of the parental alleles on chromosome 7. By applying DMR analysis, we recover DMRs associated with known imprinted genes and visualise the methylation patterns of these genes summarised at single-read resolution. Through DMR analysis, we identify DMRs associated with known imprinted genes and visualize their methylation patterns at single-read resolution. This streamlined workflow is adaptable to common experimental designs and offers flexibility in the choice of upstream data sources and downstream statistical analysis tools.

Keywords

References

  1. Genome Res. 2021 Jul;31(7):1280-1289 [PMID: 34140313]
  2. Genome Res. 2024 Nov 20;34(11):1954-1965 [PMID: 39284686]
  3. F1000Res. 2017 Nov 28;6:2055 [PMID: 29333247]
  4. Genome Biol. 2012 Oct 03;13(10):R83 [PMID: 23034175]
  5. Proc Natl Acad Sci U S A. 1992 Mar 1;89(5):1827-31 [PMID: 1542678]
  6. Nat Commun. 2023 Jul 8;14(1):4054 [PMID: 37422489]
  7. PLoS Comput Biol. 2021 Oct 25;17(10):e1009524 [PMID: 34695109]
  8. Genome Biol. 2016 Oct 7;17(1):208 [PMID: 27717381]
  9. Bioinformatics. 2016 May 15;32(10):1446-53 [PMID: 26819470]
  10. Genome Biol. 2023 Apr 20;24(1):83 [PMID: 37081487]
  11. Nat Struct Mol Biol. 2010 Aug;17(8):948-54 [PMID: 20657585]
  12. Biostatistics. 2019 Jul 1;20(3):367-383 [PMID: 29481604]
  13. Nat Rev Genet. 2005 Aug;6(8):597-610 [PMID: 16136652]
  14. PLoS Comput Biol. 2013;9(8):e1003118 [PMID: 23950696]
  15. Genes Dev. 2011 May 15;25(10):1010-22 [PMID: 21576262]
  16. BMC Bioinformatics. 2020 Aug 5;21(1):343 [PMID: 32758139]
  17. Genes Cancer. 2011 Jun;2(6):607-17 [PMID: 21941617]
  18. Nucleic Acids Res. 2015 Apr 20;43(7):e47 [PMID: 25605792]
  19. Essays Biochem. 2019 Dec 20;63(6):639-648 [PMID: 31755932]
  20. Nature. 2021 Oct;598(7879):120-128 [PMID: 34616061]
  21. Nat Methods. 2015 Feb;12(2):115-21 [PMID: 25633503]
  22. Nat Methods. 2017 Apr;14(4):407-410 [PMID: 28218898]

MeSH Term

DNA Methylation
Animals
Workflow
Software
Mice
Sequence Analysis, DNA
CpG Islands

Word Cloud

Created with Highcharts 10.0.0methylationanalysisdataDNAlong-readDMRsresolutiontoolsNanoMethVizBioconductorworkflowpatternssingle-readgenesLong-readsequencinggenecharacteristicsmodBAMsoftwareusedmrseqregionsdownstreamvisualizeDMRassociatedknownimprintedstreamlinedtechnologiestransformedfieldepigeneticsenablingdirectsingle-basedetectionmodificationsproducesnovelopportunitiesstudyingroleregulationimprintingdiseaseHoweveruniqueincludingformatextendedreadlengthsnecessitatedevelopmentspecialisedeffectivepackageprovidessuiteloadingvisualisingvariousresolutionscanconvertbsseqDSSedgeRdiscoverdifferentiallymethylatedarticledemonstrateprocessconvertingfilesformatssuitablecomprehensiveleverageconductexploratoryvisuallysummarizingdifferencessamplesexaminingaggregateprofilesacrossCpGislandsinvestigatingwithinspecificlevelAdditionallyillustrateidentifyingshowintegratefindingsgene-levelvisualizationplotsappliedtriplicatedatasethaplotypedmouseneuralstemcellsallowinguscompareparentalalleleschromosome7applyingrecovervisualisesummarisedidentifyadaptablecommonexperimentaldesignsoffersflexibilitychoiceupstreamsourcesstatisticalDifferentialEpigenetics

Similar Articles

Cited By

No available data.