TeaPVs: a comprehensive genomic variation database for tea plant (Camellia sinensis).

Yanlin An, Xiaoqin Zhang, Sixia Jiang, Jingjing Zhao, Feng Zhang
Author Information
  1. Yanlin An: Department of Food Science and Engineering, Moutai Institute, Luban Street, Renhuai, 564502, Guizhou, People's Republic of China.
  2. Xiaoqin Zhang: Department of Food Science and Engineering, Moutai Institute, Luban Street, Renhuai, 564502, Guizhou, People's Republic of China.
  3. Sixia Jiang: Department of Food Science and Engineering, Moutai Institute, Luban Street, Renhuai, 564502, Guizhou, People's Republic of China.
  4. Jingjing Zhao: Department of Food Science and Engineering, Moutai Institute, Luban Street, Renhuai, 564502, Guizhou, People's Republic of China.
  5. Feng Zhang: Department of Food Science and Engineering, Moutai Institute, Luban Street, Renhuai, 564502, Guizhou, People's Republic of China. nkzhangfeng@163.com. ORCID

Abstract

Genome variation not only plays an important role in plant phenotypic modeling and adaptive evolution, but also enhances population genetic diversity and regulates gene expression. The tea tree (Camellia sinensis) has a large genome (~ 3.0 Gb), making the identification of genome-wide variants time-consuming and expensive. With the continuous publication of a large number of different types of population sequencing data, there is a lack of an open platform to integrate these data and identify variants in the tea plant genome.To integrate the genetic variation confidence in the tea plant population genome, 238 whole-genome resequencing, 213 transcriptome sequencing, and 96 hybrid F1 individuals with a total of more than 20 Tb were collected for mutation site identification. Based on these variations information, we constructed the first tea tree variation web service database TeaPVs ( http://47.106.184.91:8025/ and http://liushang.top:8025/ ). It supports users to search all SNP, Indel, SV mutations and SSR/Polymorphic SSR sequences by location or gene ID. Furthermore, the website also provides the functions of gene expression search of different transcriptome, sequence blast, sequence extraction of CDS and mutation loci, etc.The features of the TeaPVs database make it a comprehensive tea plant genetic variation bioinformatics platform for researchers, and will also be helpful for revealing new functional mutations in the tea plant genome and molecular marker-assisted breeding.

Keywords

References

  1. BMC Genomics. 2019 Dec 5;20(1):935 [PMID: 31805860]
  2. Plant Biotechnol J. 2021 Jan;19(1):192-205 [PMID: 32722872]
  3. BMC Genomics. 2020 Jul 3;21(1):461 [PMID: 32620074]
  4. Nat Commun. 2019 Mar 11;10(1):1154 [PMID: 30858362]
  5. Hortic Res. 2021 May 1;8(1):107 [PMID: 33931633]
  6. Nucleic Acids Res. 2010 Sep;38(16):e164 [PMID: 20601685]
  7. Mol Plant. 2017 Jun 5;10(6):866-877 [PMID: 28473262]
  8. Hortic Res. 2021 Aug 10;8(1):190 [PMID: 34376642]
  9. Mol Plant. 2020 Aug 3;13(8):1098-1100 [PMID: 32416265]
  10. Front Plant Sci. 2021 Jul 29;12:705285 [PMID: 34394160]
  11. Planta. 2019 Oct;250(4):1111-1129 [PMID: 31172343]
  12. Plant Biotechnol J. 2019 Sep;17(9):1723-1735 [PMID: 30776191]
  13. Nat Genet. 2021 Aug;53(8):1250-1259 [PMID: 34267370]
  14. Nat Protoc. 2016 Sep;11(9):1650-67 [PMID: 27560171]
  15. Mol Plant. 2020 Jul 6;13(7):1013-1026 [PMID: 32353625]
  16. Biotechnol Biofuels. 2021 Aug 3;14(1):165 [PMID: 34344425]
  17. Mol Plant. 2020 Jul 6;13(7):935-938 [PMID: 32353626]
  18. Nat Genet. 2019 Jun;51(6):1052-1059 [PMID: 31152161]
  19. Genome Biol. 2021 Jan 5;22(1):13 [PMID: 33402202]
  20. Nat Commun. 2020 Jul 24;11(1):3719 [PMID: 32709943]
  21. Front Plant Sci. 2021 Dec 21;12:803736 [PMID: 34992626]
  22. BMC Plant Biol. 2020 Mar 18;20(1):119 [PMID: 32183712]
  23. Plant J. 2021 May;106(3):862-875 [PMID: 33595875]
  24. Bioinformatics. 2021 Oct 02;: [PMID: 34601584]
  25. Genome Biol. 2020 Aug 3;21(1):189 [PMID: 32746918]
  26. BMC Plant Biol. 2021 Jun 21;21(1):280 [PMID: 34154536]
  27. Nat Commun. 2020 Sep 7;11(1):4447 [PMID: 32895382]
  28. Front Genet. 2020 Jul 27;11:706 [PMID: 32849772]
  29. Bioinformatics. 2009 Aug 15;25(16):2078-9 [PMID: 19505943]
  30. Front Plant Sci. 2016 Aug 30;7:1310 [PMID: 27625670]
  31. Bioinformatics. 2018 Sep 15;34(18):3094-3100 [PMID: 29750242]
  32. Plant Biotechnol J. 2019 Oct;17(10):1938-1953 [PMID: 30913342]

Grants

  1. 32160441/the National Natural Science Foundation of China

MeSH Term

Camellia sinensis
Plant Breeding
Genome, Plant
Tea
Genomics

Chemicals

Tea

Word Cloud

Created with Highcharts 10.0.0plantteavariationgenomealsopopulationgeneticgenetranscriptomedatabaseGenomeexpressiontreeCamelliasinensislargeidentificationvariantsdifferentsequencingdataplatformintegratemutationTeaPVssearchmutationssequencecomprehensiveplaysimportantrolephenotypicmodelingadaptiveevolutionenhancesdiversityregulates~ 30Gbmakinggenome-widetime-consumingexpensivecontinuouspublicationnumbertypeslackopenidentifyToconfidence238whole-genomeresequencing21396hybridF1individualstotal20 TbcollectedsiteBasedvariationsinformationconstructedfirstwebservicehttp://4710618491:8025/http://liushangtop:8025/supportsusersSNPIndelSVSSR/PolymorphicSSRsequenceslocationIDFurthermorewebsiteprovidesfunctionsblastextractionCDSlocietcThefeaturesmakebioinformaticsresearcherswillhelpfulrevealingnewfunctionalmolecularmarker-assistedbreedingTeaPVs:genomicDatabaseResequenceTeaVariations

Similar Articles

Cited By