SorGSD: updating and expanding the sorghum genome science database with new contents and tools.

Yuanming Liu, Zhonghuang Wang, Xiaoyuan Wu, Junwei Zhu, Hong Luo, Dongmei Tian, Cuiping Li, Jingchu Luo, Wenming Zhao, Huaiqing Hao, Hai-Chun Jing
Author Information
  1. Yuanming Liu: Key Laboratory of Plant Resources, Institute of Botany, Chinese Academy of Sciences, Beijing, 100093, China.
  2. Zhonghuang Wang: University of Chinese Academy of Sciences, Beijing, 100049, China.
  3. Xiaoyuan Wu: Key Laboratory of Plant Resources, Institute of Botany, Chinese Academy of Sciences, Beijing, 100093, China.
  4. Junwei Zhu: China National Center for Bioinformation, Beijing, 100101, China.
  5. Hong Luo: Key Laboratory of Plant Resources, Institute of Botany, Chinese Academy of Sciences, Beijing, 100093, China.
  6. Dongmei Tian: China National Center for Bioinformation, Beijing, 100101, China.
  7. Cuiping Li: China National Center for Bioinformation, Beijing, 100101, China.
  8. Jingchu Luo: College of Life Sciences and Center for Bioinformatics, Peking University, Beijing, 100871, China.
  9. Wenming Zhao: University of Chinese Academy of Sciences, Beijing, 100049, China. zhaowm@big.ac.cn.
  10. Huaiqing Hao: Key Laboratory of Plant Resources, Institute of Botany, Chinese Academy of Sciences, Beijing, 100093, China. hqhao@ibcas.ac.cn. ORCID
  11. Hai-Chun Jing: Key Laboratory of Plant Resources, Institute of Botany, Chinese Academy of Sciences, Beijing, 100093, China.

Abstract

BACKGROUND: As the fifth major cereal crop originated from Africa, sorghum (Sorghum bicolor) has become a key C model organism for energy plant research. With the development of high-throughput detection technologies for various omics data, much multi-dimensional and multi-omics information has been accumulated for sorghum. Integrating this information may accelerate genetic research and improve molecular breeding for sorghum agronomic traits.
RESULTS: We updated the Sorghum Genome SNP Database (SorGSD) by adding new data, new features and renamed it to Sorghum Genome Science Database (SorGSD). In comparison with the original version SorGSD, which contains SNPs from 48 sorghum accessions mapped to the reference genome BTx623 (v2.1), the new version was expanded to 289 sorghum lines with both single nucleotide polymorphisms (SNPs) and small insertions/deletions (INDELs), which were aligned to the newly assembled and annotated sorghum genome BTx623 (v3.1). Moreover, phenotypic data and panicle pictures of critical accessions were provided in the new version. We implemented new tools including ID Conversion, Homologue Search and Genome Browser for analysis and updated the general information related to sorghum research, such as online sorghum resources and literature references. In addition, we deployed a new database infrastructure and redesigned a new user interface as one of the Genome Variation Map databases. The new version SorGSD is freely accessible online at http://ngdc.cncb.ac.cn/sorgsd/ .
CONCLUSIONS: SorGSD is a comprehensive integration with large-scale genomic variation, phenotypic information and incorporates online data analysis tools for data mining, genome navigation and analysis. We hope that SorGSD could provide a valuable resource for sorghum researchers to find variations they are interested in and generate customized high-throughput datasets for further analysis.

Keywords

References

  1. Plant J. 2019 Jan;97(1):19-39 [PMID: 30260043]
  2. Bioinformatics. 2009 Jul 15;25(14):1754-60 [PMID: 19451168]
  3. Plant Genome. 2015 Jul;8(2):eplantgenome2014.09.0048 [PMID: 33228310]
  4. BMC Genomics. 2014 Oct 01;15:832 [PMID: 25270086]
  5. Nat Plants. 2021 Jun;7(6):766-773 [PMID: 34017083]
  6. Nucleic Acids Res. 2017 Jan 4;45(D1):D1009-D1014 [PMID: 28053167]
  7. Bioinformatics. 2011 Aug 1;27(15):2156-8 [PMID: 21653522]
  8. Nucleic Acids Res. 2021 Jan 8;49(D1):D1186-D1191 [PMID: 33170268]
  9. Nucleic Acids Res. 2012 Jan;40(Database issue):D1178-86 [PMID: 22110026]
  10. PLoS One. 2017 Feb 24;12(2):e0172269 [PMID: 28234924]
  11. Bioinformatics. 2009 Aug 15;25(16):2078-9 [PMID: 19505943]
  12. Nucleic Acids Res. 2017 Jan 4;45(D1):D1040-D1045 [PMID: 27924042]
  13. Nucleic Acids Res. 2019 Jan 8;47(D1):D793-D800 [PMID: 30371881]
  14. Plant Biotechnol J. 2020 Apr;18(4):1093-1105 [PMID: 31659829]
  15. Genome Biol. 2011 Nov 21;12(11):R114 [PMID: 22104744]
  16. Nature. 2009 Jan 29;457(7229):551-6 [PMID: 19189423]
  17. Genome Biol. 2016 Jun 06;17(1):122 [PMID: 27268795]
  18. Genome Res. 2010 Sep;20(9):1297-303 [PMID: 20644199]
  19. Proc Natl Acad Sci U S A. 2013 Jan 8;110(2):453-8 [PMID: 23267105]
  20. Plant Cell. 2018 Oct;30(10):2286-2307 [PMID: 30309900]
  21. Plant J. 2018 Jan;93(2):338-354 [PMID: 29161754]
  22. Plant Cell Physiol. 2015 Jan;56(1):e6 [PMID: 25505007]
  23. Database (Oxford). 2018 Jan 1;2018: [PMID: 29939244]
  24. Nucleic Acids Res. 2021 Jan 8;49(D1):D723-D733 [PMID: 33152092]
  25. Nat Commun. 2013;4:2320 [PMID: 23982223]
  26. Genome Biol. 2013 Jun 26;14(6):R68 [PMID: 23803286]
  27. Biotechnol Biofuels. 2016 Feb 15;9:37 [PMID: 26884811]
  28. Genomics Proteomics Bioinformatics. 2017 Feb;15(1):14-18 [PMID: 28387199]
  29. Nucleic Acids Res. 2024 Jan 5;52(D1):D18-D32 [PMID: 38018256]
  30. Genome. 2018 Apr;61(4):223-232 [PMID: 29432699]
  31. Database (Oxford). 2016 Jun 26;2016: [PMID: 27352859]
  32. Theor Appl Genet. 2019 Mar;132(3):751-766 [PMID: 30343386]
  33. Plant Genome. 2016 Jul;9(2): [PMID: 27898823]
  34. Nucleic Acids Res. 2016 Jan 4;44(D1):D1141-7 [PMID: 26527721]
  35. Genetics. 2016 Sep;204(1):21-33 [PMID: 27356613]
  36. Theor Appl Genet. 2021 Jul;134(7):1899-1924 [PMID: 33655424]
  37. Plant Physiol. 2005 Aug;138(4):1898-902 [PMID: 16172096]
  38. Nucleic Acids Res. 2021 Jan 8;49(D1):D1452-D1463 [PMID: 33170273]
  39. Nucleic Acids Res. 2021 Jan 8;49(D1):D1464-D1471 [PMID: 33237299]
  40. G3 (Bethesda). 2013 Nov 06;3(11):2085-94 [PMID: 24048646]

Grants

  1. 2018YFD1000701/Ministry of Science and Technology of the People's Republic of China
  2. 32072026/National Natural Science Foundation of China

Links to CNCB-NGDC Resources

Database Commons: DBC001792 (SorGSD)

Word Cloud

Similar Articles

Cited By