A chromosome-level reference genome of the wax gourd (Benincasa hispida).

Wenlong Luo, Jinqiang Yan, Shanwei Luo, Wenrui Liu, Dasen Xie, Biao Jiang
Author Information
  1. Wenlong Luo: Guangdong Key Laboratory for New Technology Research of Vegetables, Vegetable Research Institute, Guangdong Academy of Agricultural Sciences, Guangzhou, 510640, China.
  2. Jinqiang Yan: Guangdong Key Laboratory for New Technology Research of Vegetables, Vegetable Research Institute, Guangdong Academy of Agricultural Sciences, Guangzhou, 510640, China.
  3. Shanwei Luo: Guangdong Key Laboratory for New Technology Research of Vegetables, Vegetable Research Institute, Guangdong Academy of Agricultural Sciences, Guangzhou, 510640, China.
  4. Wenrui Liu: Guangdong Key Laboratory for New Technology Research of Vegetables, Vegetable Research Institute, Guangdong Academy of Agricultural Sciences, Guangzhou, 510640, China.
  5. Dasen Xie: Guangdong Key Laboratory for New Technology Research of Vegetables, Vegetable Research Institute, Guangdong Academy of Agricultural Sciences, Guangzhou, 510640, China.
  6. Biao Jiang: Guangdong Key Laboratory for New Technology Research of Vegetables, Vegetable Research Institute, Guangdong Academy of Agricultural Sciences, Guangzhou, 510640, China. jiangbiao@gdaas.cn.

Abstract

The wax gourd (Benincasa hispida), the only species in the genus Benincasa, is an important crop native to Asia that has been widely planted for multi-purpose uses. The first wax gourd draft genome was published three years ago, but it was incomplete and highly-fragmented due to data and technical limitations. Herein, we report a new chromosome-level genome assembly and annotation of B. hispida. We generated 974.87 Mb of unitigs with N50 size of 2.43 Mb via a hybrid assembly strategy by using PacBio long reads and Illumina short reads. We then joined them into scaffolds with Hi-C data, resulting 1862 scaffolds with a total length of 975.62 Mb, and 94.92% of the length (926.05 Mb) is contained in the 12 largest scaffolds corresponding to the 12 chromosomes of B. hispida. We predicted 37,092 protein-coding genes, and 85.05% of them were functionally annotated. This chromosome-level reference genome provides significant improvement to the earlier version of draft genome and would be valuable resource for research and molecular breeding of the wax gourd.

References

  1. Bioinformatics. 2014 May 1;30(9):1236-40 [PMID: 24451626]
  2. Bioinformatics. 2016 Jan 15;32(2):292-4 [PMID: 26428292]
  3. Genome Biol. 2019 Dec 16;20(1):277 [PMID: 31842948]
  4. Nat Commun. 2019 Nov 14;10(1):5158 [PMID: 31727887]
  5. Proc Natl Acad Sci U S A. 2020 Apr 28;117(17):9451-9457 [PMID: 32300014]
  6. Cell Syst. 2016 Jul;3(1):99-101 [PMID: 27467250]
  7. Bioinformatics. 2017 Jul 01;33(13):2037-2039 [PMID: 28205675]
  8. BMC Genomics. 2015 Dec 09;16:1035 [PMID: 26647294]
  9. Oxid Med Cell Longev. 2021 Dec 10;2021:6349041 [PMID: 34925698]
  10. Bioinformatics. 2009 Aug 15;25(16):2078-9 [PMID: 19505943]
  11. Genome Res. 2017 May;27(5):787-792 [PMID: 28130360]
  12. Nucleic Acids Res. 2018 Jul 2;46(W1):W84-W88 [PMID: 29741643]
  13. Bioinformatics. 2009 Jul 15;25(14):1754-60 [PMID: 19451168]
  14. Nucleic Acids Res. 2020 Jul 2;48(W1):W177-W184 [PMID: 32301980]
  15. Hortic Res. 2018 Feb 7;5:8 [PMID: 29423238]
  16. Nucleic Acids Res. 2022 Jan 7;50(D1):D27-D38 [PMID: 34718731]
  17. Bioinformatics. 2008 Mar 1;24(5):637-44 [PMID: 18218656]
  18. Science. 2017 Apr 7;356(6333):92-95 [PMID: 28336562]
  19. Mol Biol Evol. 2021 Dec 9;38(12):5825-5829 [PMID: 34597405]
  20. Nature. 2017 Jun 22;546(7659):524-527 [PMID: 28605751]
  21. Mol Plant. 2020 Aug 3;13(8):1194-1202 [PMID: 32585190]
  22. Genomics Proteomics Bioinformatics. 2021 Aug;19(4):584-589 [PMID: 34175476]
  23. Mol Plant. 2015 Mar;8(3):489-92 [PMID: 25667002]
  24. Bioinformatics. 2015 Oct 1;31(19):3210-2 [PMID: 26059717]
  25. Bioinformatics. 2011 Aug 1;27(15):2156-8 [PMID: 21653522]
  26. NAR Genom Bioinform. 2021 Jan 06;3(1):lqaa108 [PMID: 33575650]
  27. Curr Protoc Bioinformatics. 2009 Mar;Chapter 4:4.10.1-4.10.14 [PMID: 19274634]
  28. Bioinformatics. 2013 Jan 1;29(1):15-21 [PMID: 23104886]
  29. Curr Drug Discov Technol. 2021;18(1):8-16 [PMID: 31660838]
  30. Bioinformatics. 2018 Sep 15;34(18):3094-3100 [PMID: 29750242]

MeSH Term

Asia
Chromosomes
Cucurbitaceae
Phylogeny
Genome, Plant

Word Cloud

Created with Highcharts 10.0.0genomewaxgourdhispidaBenincasachromosome-levelscaffoldsdraftdataassemblyBreadslength12referencespeciesgenusimportantcropnativeAsiawidelyplantedmulti-purposeusesfirstpublishedthreeyearsagoincompletehighly-fragmentedduetechnicallimitationsHereinreportnewannotationgenerated97487 MbunitigsN50size243 MbviahybridstrategyusingPacBiolongIlluminashortjoinedHi-Cresulting1862with atotal97562 Mb9492%92605 Mbcontainedlargestcorrespondingchromosomespredicted37092protein-codinggenes8505%functionallyannotatedprovidessignificantimprovementearlierversionvaluableresourceresearchmolecularbreeding

Similar Articles

Cited By