A chromosome-level genome assembly of Cape hare (Lepus capensis).

Xianggui Dong, Yu Liu, Yuan Chen, Xinxin Ping, Zhanjun Ren, Yuanyuan Zhang
Author Information
  1. Xianggui Dong: College of Animal Science and Technology, Northwest A&F University, Yangling, 712100, China. xgdong@nwafu.edu.cn.
  2. Yu Liu: College of Animal Science and Technology, China Agricultural University, Beijing, 100193, China.
  3. Yuan Chen: College of Animal Science and Technology, Northwest A&F University, Yangling, 712100, China.
  4. Xinxin Ping: College of Animal Science and Technology, Northwest A&F University, Yangling, 712100, China.
  5. Zhanjun Ren: College of Animal Science and Technology, Northwest A&F University, Yangling, 712100, China. ORCID
  6. Yuanyuan Zhang: College of Animal Science and Technology, China Agricultural University, Beijing, 100193, China. yyzhang@cau.edu.cn.

Abstract

The Cape hare (Lepus capensis) is among the most widely distributed hare species globally, inhabiting extensive regions across Africa, the Middle East, and Central Asia. However, evolutionary and genetic research on L. capensis was seriously impeded by the absence of a reference genome. Here, we assembled and constructed a chromosome-level genome of L. capensis (with scaffolds anchored to 25 chromosomes and a total assembled length of 2.9���Gb, achieving a contig N50 length of 124.44���Mb) using PacBio HiFi sequencing and Hi-C assembly technology. Evaluation using BUSCO indicated the genome assembly to be 98.2% complete. The de novo prediction revealed that repetitive sequences constitute 46.13% of the entire genome, and long interspersed nuclear elements (LINEs) constituted the largest portion. We annotated a total of 13, 868 protein-coding genes using transcriptomes from two tissues (muscle and skin). This high-quality reference genome serves as a valuable genomic resource for advancing genetic studies in this species.

References

  1. Genomics. 2021 Sep;113(5):3216-3223 [PMID: 34051323]
  2. Nucleic Acids Res. 2012 Apr;40(7):e49 [PMID: 22217600]
  3. Cell Syst. 2018 Feb 28;6(2):256-258.e1 [PMID: 29428417]
  4. Mol Plant. 2020 Aug 3;13(8):1194-1202 [PMID: 32585190]
  5. Mol Biol Evol. 2021 Sep 27;38(10):4647-4654 [PMID: 34320186]
  6. NAR Genom Bioinform. 2021 Jan 06;3(1):lqaa108 [PMID: 33575650]
  7. Genomics Proteomics Bioinformatics. 2021 Aug;19(4):578-583 [PMID: 34400360]
  8. BMC Evol Biol. 2011 Jul 28;11:223 [PMID: 21794180]
  9. BMC Bioinformatics. 2009 Dec 15;10:421 [PMID: 20003500]
  10. Bioinformatics. 2009 Jul 15;25(14):1754-60 [PMID: 19451168]
  11. Mob DNA. 2015 Jun 02;6:11 [PMID: 26045719]
  12. Bioinformatics. 2017 Jul 15;33(14):2202-2204 [PMID: 28369201]
  13. Nucleic Acids Res. 2022 Jan 7;50(D1):D27-D38 [PMID: 34718731]
  14. Science. 2017 Apr 7;356(6333):92-95 [PMID: 28336562]
  15. Mol Biol Evol. 2021 Dec 9;38(12):5825-5829 [PMID: 34597405]
  16. Nat Biotechnol. 2011 May 15;29(7):644-52 [PMID: 21572440]
  17. Cell Syst. 2016 Jul;3(1):95-8 [PMID: 27467249]
  18. Nucleic Acids Res. 1999 Apr 15;27(8):1767-80 [PMID: 10101183]
  19. Proc Natl Acad Sci U S A. 2020 Apr 28;117(17):9451-9457 [PMID: 32300014]
  20. Curr Protoc Bioinformatics. 2009 Mar;Chapter 4:4.10.1-4.10.14 [PMID: 19274634]
  21. Nat Methods. 2021 Feb;18(2):170-175 [PMID: 33526886]
  22. Bioinformatics. 2013 Jan 1;29(1):15-21 [PMID: 23104886]
  23. Bioinformatics. 2011 Mar 15;27(6):764-70 [PMID: 21217122]
  24. Genome Biol. 2008 Jan 11;9(1):R7 [PMID: 18190707]
  25. Bioinformatics. 2009 May 15;25(10):1329-30 [PMID: 19349283]
  26. Nucleic Acids Res. 2019 Jan 8;47(D1):D309-D314 [PMID: 30418610]

MeSH Term

Animals
Genome
Chromosomes
Hares
Long Interspersed Nucleotide Elements
Transcriptome

Word Cloud

Created with Highcharts 10.0.0genomecapensishareusingassemblyCapeLepusspeciesgeneticLreferenceassembledchromosome-leveltotallengthamongwidelydistributedgloballyinhabitingextensiveregionsacrossAfricaMiddleEastCentralAsiaHoweverevolutionaryresearchseriouslyimpededabsenceconstructedscaffoldsanchored25chromosomes29���GbachievingcontigN5012444���MbPacBioHiFisequencingHi-CtechnologyEvaluationBUSCOindicated982%completedenovopredictionrevealedrepetitivesequencesconstitute4613%entirelonginterspersednuclearelementsLINEsconstitutedlargestportionannotated13868protein-codinggenestranscriptomestwotissuesmuscleskinhigh-qualityservesvaluablegenomicresourceadvancingstudies

Similar Articles

Cited By

No available data.