An approach of identifying differential nucleosome regions in multiple samples.

Lingjie Liu, Jianming Xie, Xiao Sun, Kun Luo, Zhaohui Steve Qin, Hongde Liu
Author Information
  1. Lingjie Liu: State Key Laboratory of Bioelectronics, School of Biological Science & Medical Engineering, Southeast University, Nanjing, 210096, China.
  2. Jianming Xie: State Key Laboratory of Bioelectronics, School of Biological Science & Medical Engineering, Southeast University, Nanjing, 210096, China.
  3. Xiao Sun: State Key Laboratory of Bioelectronics, School of Biological Science & Medical Engineering, Southeast University, Nanjing, 210096, China.
  4. Kun Luo: Department of Neurosurgery, Xinjiang Evidence-Based Medicine Research Institute, First Affiliated Hospital of Xinjiang Medical University, Urumqi, 830054, China.
  5. Zhaohui Steve Qin: Department of Biostatistics and Bioinformatics, Rollins School of Public Health, Emory University, Atlanta, GA, 30322, USA.
  6. Hongde Liu: State Key Laboratory of Bioelectronics, School of Biological Science & Medical Engineering, Southeast University, Nanjing, 210096, China. liuhongde@seu.edu.cn.

Abstract

BACKGROUND: Nucleosome plays a role in transcriptional regulation through occluding the binding of proteins to DNA sites. Nucleosome occupancy varies among different cell types. Identification of such variation will help to understand regulation mechanism. The previous researches focused on the methods for two-sample comparison. However, a multiple-sample comparison (n ≥ 3) is necessary, especially in studying development and cancer. METHODS: Here, we proposed a Chi-squared test-based approach, named as Dimnp, to identify differential nucleosome regions (DNRs) in multiple samples. Dimnp is designed for sequenced reads data and includes the modules of both calling nucleosome occupancy and identifying DNRs.
RESULTS: We validated Dimnp on dataset of the mutant strains in which the modifiable histone residues are mutated into alanine in Saccharomyces cerevisiae. Dimnp shows a good capacity (area under the curve > 0.87) compared with the manually identified DNRs. Just by one time, Dimnp is able to identify all the DNRs identified by two-sample method Danpos. Under a deviation of 40 bp, the matched DNRs are above 60% between Dimnp and Danpos. With Dimnp, we found that promoters and telomeres are highly dynamic upon mutating the modifiable histone residues.
CONCLUSIONS: We developed a tool of identifying the DNRs in multiple samples and cell types. The tool can be applied in studying nucleosome variation in gradual change in development and cancer.

Keywords

References

  1. Genome Res. 2011 Oct;21(10):1650-8 [PMID: 21795385]
  2. Genome Res. 2013 Feb;23(2):341-51 [PMID: 23193179]
  3. Bioinformatics. 2012 Aug 1;28(15):1965-71 [PMID: 22668788]
  4. Nat Rev Genet. 2009 Mar;10(3):161-72 [PMID: 19204718]
  5. Sci Rep. 2015 Oct 26;5:15583 [PMID: 26498326]
  6. Nat Struct Mol Biol. 2012 Nov;19(11):1185-92 [PMID: 23085715]
  7. J Biol Chem. 2015 Jan 2;290(1):197-208 [PMID: 25378406]
  8. Cell. 2010 Nov 24;143(5):725-36 [PMID: 21111233]
  9. Bioinformatics. 2015 Jun 15;31(12):1889-96 [PMID: 25682068]
  10. Nature. 2006 Aug 17;442(7104):772-8 [PMID: 16862119]
  11. Nat Commun. 2014 Aug 27;5:4719 [PMID: 25158628]
  12. PLoS Comput Biol. 2014 Mar 27;10 (3):e1003501 [PMID: 24675637]
  13. Nat Genet. 2010 Apr;42(4):343-7 [PMID: 20208536]

MeSH Term

Algorithms
Binding Sites
Chi-Square Distribution
Computational Biology
DNA
Datasets as Topic
Models, Statistical
Nucleosomes
ROC Curve
Reproducibility of Results
Saccharomyces cerevisiae

Chemicals

Nucleosomes
DNA

Word Cloud

Created with Highcharts 10.0.0DimnpDNRsnucleosomeNucleosomecelltypesregionsmultiplesamplesidentifyingregulationoccupancyvariationtwo-samplecomparisonstudyingdevelopmentcancerChi-squaredapproachidentifydifferentialmodifiablehistoneresiduesidentifiedDanpostoolBACKGROUND:playsroletranscriptionaloccludingbindingproteinsDNAsitesvariesamongdifferentIdentificationwillhelpunderstandmechanismpreviousresearchesfocusedmethodsHowevermultiple-samplen ≥ 3necessaryespeciallyMETHODS:proposedtest-basednameddesignedsequencedreadsdataincludesmodulescallingRESULTS:validateddatasetmutantstrainsmutatedalanineSaccharomycescerevisiaeshowsgoodcapacityareacurve > 087comparedmanuallyJustonetimeablemethoddeviation40 bpmatched60%foundpromoterstelomereshighlydynamicuponmutatingCONCLUSIONS:developedcanappliedgradualchangetestDifferentialMultiple

Similar Articles

Cited By