ProPan: a comprehensive database for profiling prokaryotic pan-genome dynamics.

Yadong Zhang, Hao Zhang, Zaichao Zhang, Qiheng Qian, Zhewen Zhang, Jingfa Xiao
Author Information
  1. Yadong Zhang: National Genomics Data Center, Beijing Institute of Genomics, Chinese Academy of Sciences and China National Center for Bioinformation, Beijing 100101, China. ORCID
  2. Hao Zhang: National Genomics Data Center, Beijing Institute of Genomics, Chinese Academy of Sciences and China National Center for Bioinformation, Beijing 100101, China. ORCID
  3. Zaichao Zhang: Department of Biology, The University of Western Ontario, London, Ontario N6A 5B7, Canada.
  4. Qiheng Qian: National Genomics Data Center, Beijing Institute of Genomics, Chinese Academy of Sciences and China National Center for Bioinformation, Beijing 100101, China.
  5. Zhewen Zhang: National Genomics Data Center, Beijing Institute of Genomics, Chinese Academy of Sciences and China National Center for Bioinformation, Beijing 100101, China.
  6. Jingfa Xiao: National Genomics Data Center, Beijing Institute of Genomics, Chinese Academy of Sciences and China National Center for Bioinformation, Beijing 100101, China. ORCID

Abstract

Compared with conventional comparative genomics, the recent studies in pan-genomics have provided further insights into species genomic dynamics, taxonomy and identification, pathogenicity and environmental adaptation. To better understand genome characteristics of species of interest and to fully excavate key metabolic and resistant genes and their conservations and variations, here we present ProPan (https://ngdc.cncb.ac.cn/propan), a public database covering 23 archaeal species and 1,481 bacterial species (in a total of 51,882 strains) for comprehensively profiling prokaryotic pan-genome dynamics. By analyzing and integrating these massive datasets, ProPan offers three major aspects for the pan-genome dynamics of the species of interest: 1) the evaluations of various species' characteristics and composition in pan-genome dynamics; 2) the visualization of map association, the functional annotation and presence/absence variation for all contained species' gene clusters; 3) the typical characteristics of the environmental adaptation, including resistance genes prediction of 126 substances (biocide, antimicrobial drug and metal) and evaluation of 31 metabolic cycle processes. Besides, ProPan develops a very user-friendly interface, flexible retrieval and multi-level real-time statistical visualization. Taken together, ProPan will serve as a weighty resource for the studies of prokaryotic pan-genome dynamics, taxonomy and identification as well as environmental adaptation.

References

  1. BMC Evol Biol. 2017 Aug 02;17(1):176 [PMID: 28768476]
  2. Int Microbiol. 2010 Jun;13(2):45-57 [PMID: 20890839]
  3. Curr Opin Microbiol. 2015 Feb;23:148-54 [PMID: 25483351]
  4. Science. 2018 Mar 2;359(6379): [PMID: 29371424]
  5. Annu Rev Genomics Hum Genet. 2020 Aug 31;21:139-162 [PMID: 32453966]
  6. Proc Natl Acad Sci U S A. 2005 Sep 27;102(39):13950-5 [PMID: 16172379]
  7. Curr Opin Microbiol. 2008 Oct;11(5):472-7 [PMID: 19086349]
  8. Nucleic Acids Res. 2020 Jan 8;48(D1):D579-D589 [PMID: 31647104]
  9. Nucleic Acids Res. 2021 Jan 8;49(D1):D605-D612 [PMID: 33237311]
  10. Nucleic Acids Res. 2010 Jan;38(Database issue):D396-400 [PMID: 19906701]
  11. New Microbes New Infect. 2015 Jun 26;7:72-85 [PMID: 26442149]
  12. Nucleic Acids Res. 2013 Jan;41(Database issue):D366-76 [PMID: 23203876]
  13. Genome Biol. 2019 Nov 5;20(1):232 [PMID: 31690338]
  14. Nucleic Acids Res. 2021 Jan 8;49(D1):D751-D763 [PMID: 33119741]
  15. Antimicrob Agents Chemother. 2014;58(1):212-20 [PMID: 24145532]
  16. Nucleic Acids Res. 2014 Jan;42(Database issue):D617-24 [PMID: 24203705]
  17. Nat Commun. 2018 Nov 30;9(1):5114 [PMID: 30504855]
  18. Nucleic Acids Res. 2019 Jan 8;47(D1):D382-D389 [PMID: 30462302]
  19. Nucleic Acids Res. 2022 Jan 7;50(D1):D27-D38 [PMID: 34718731]
  20. Microbiome. 2022 Feb 16;10(1):33 [PMID: 35172890]
  21. BMC Bioinformatics. 2015 Mar 12;16:79 [PMID: 25888166]
  22. Nucleic Acids Res. 2020 Jan 8;48(D1):D621-D625 [PMID: 31647096]
  23. Mol Biol Evol. 2021 Dec 9;38(12):5825-5829 [PMID: 34597405]
  24. Brief Bioinform. 2018 Jan 1;19(1):118-135 [PMID: 27769991]
  25. Bioinformatics. 2015 Nov 15;31(22):3691-3 [PMID: 26198102]
  26. BMC Microbiol. 2020 Feb 21;20(1):38 [PMID: 32085752]
  27. Nucleic Acids Res. 2020 Jan 8;48(D1):D517-D525 [PMID: 31665441]
  28. BMC Bioinformatics. 2006 Sep 12;7:409 [PMID: 16968531]
  29. Brief Bioinform. 2021 Mar 22;22(2):1951-1971 [PMID: 32065216]
  30. Curr Opin Biotechnol. 2020 Jun;63:54-62 [PMID: 31891864]
  31. Nat Rev Microbiol. 2010 Apr;8(4):251-9 [PMID: 20190823]
  32. Nat Commun. 2022 Feb 3;13(1):682 [PMID: 35115520]
  33. Curr Biol. 2019 Oct 21;29(20):R1094-R1103 [PMID: 31639358]
  34. Sci Total Environ. 2021 Nov 15;795:148848 [PMID: 34246137]
  35. Bioinformatics. 2016 Sep 15;32(18):2847-9 [PMID: 27207943]
  36. Science. 2021 Apr 30;372(6541): [PMID: 33926925]
  37. Environ Microbiol. 2001 Jan;3(1):1-9 [PMID: 11225718]
  38. Nucleic Acids Res. 2021 Jan 8;49(D1):D389-D393 [PMID: 33196836]
  39. J Antimicrob Chemother. 2020 Dec 1;75(12):3491-3500 [PMID: 32780112]
  40. BMC Genomics. 2018 Apr 24;19(1):284 [PMID: 29690879]
  41. Nucleic Acids Res. 2021 Jan 8;49(D1):D10-D17 [PMID: 33095870]
  42. Sci Rep. 2021 Jun 16;11(1):12728 [PMID: 34135355]
  43. ISME J. 2020 Jun;14(6):1600-1613 [PMID: 32203124]
  44. Nucleic Acids Res. 2020 Jan 8;48(D1):D561-D569 [PMID: 31722416]
  45. Biol Direct. 2019 Feb 26;14(1):5 [PMID: 30808378]
  46. Gut Microbes. 2021 Jan-Dec;13(1):1-21 [PMID: 33525961]
  47. Plant Physiol. 2011 Jul;156(3):989-96 [PMID: 21606316]
  48. Nucleic Acids Res. 2002 Apr 1;30(7):1575-84 [PMID: 11917018]
  49. PLoS One. 2014 Mar 27;9(3):e92798 [PMID: 24676150]
  50. Curr Opin Microbiol. 2022 Apr;66:73-78 [PMID: 35104691]
  51. Bioinformatics. 2014 Jul 15;30(14):2068-9 [PMID: 24642063]
  52. BMC Bioinformatics. 2009 Dec 15;10:421 [PMID: 20003500]
  53. Proc Natl Acad Sci U S A. 1998 Jun 9;95(12):6578-83 [PMID: 9618454]
  54. Nucleic Acids Res. 2019 Jan 8;47(D1):D309-D314 [PMID: 30418610]
  55. Curr Opin Genet Dev. 2005 Dec;15(6):589-94 [PMID: 16185861]
  56. Biotechnol Biofuels. 2018 Jul 17;11:193 [PMID: 30026808]

MeSH Term

Archaea
Bacteria
Genome
Genome, Bacterial
Genomics
Prokaryotic Cells
Databases, Genetic

Links to CNCB-NGDC Resources

Database Commons: DBC008430 (ProPan)

Word Cloud

Created with Highcharts 10.0.0dynamicsspeciespan-genomeProPanenvironmentaladaptationcharacteristicsprokaryoticstudiestaxonomyidentificationmetabolicgenesdatabase1profilingspecies'visualizationComparedconventionalcomparativegenomicsrecentpan-genomicsprovidedinsightsgenomicpathogenicitybetterunderstandgenomeinterestfullyexcavatekeyresistantconservationsvariationspresenthttps://ngdccncbaccn/propanpubliccovering23archaeal481bacterialtotal51882strainscomprehensivelyanalyzingintegratingmassivedatasetsoffersthreemajoraspectsinterest:evaluationsvariouscomposition2mapassociationfunctionalannotationpresence/absencevariationcontainedgeneclusters3typicalincludingresistanceprediction126substancesbiocideantimicrobialdrugmetalevaluation31cycleprocessesBesidesdevelopsuser-friendlyinterfaceflexibleretrievalmulti-levelreal-timestatisticalTakentogetherwillserveweightyresourcewellProPan:comprehensive

Similar Articles

Cited By