Database Commons
Database Commons

a catalog of worldwide biological databases

Database Profile

GTDB

General information

URL: https://gtdb.ecogenomic.org/
Full name: The Genome Taxonomy Database
Description: GTDB is an initiative to establish a standardised microbial taxonomy based on genome phylogeny.
Year founded: 2018
Last update: 2019-6-19
Version: Release 04-RS89
Accessibility:
Accessible
Country/Region: Australia

Contact information

University/Institution: University of Queensland
Address: Australian Centre for Ecogenomics, School of Chemistry and Molecular Biosciences, University of Queensland, Queensland, Australia.
City: Queensland
Province/State:
Country/Region: Australia
Contact name (PI/Team): Philip Hugenholtz
Contact email (PI/Helpdesk): p.hugenholtz@uq.edu.au

Publications

34520557
GTDB: an ongoing census of bacterial and archaeal diversity through a phylogenetically consistent, rank normalized and complete genome-based taxonomy. [PMID: 34520557]
Parks DH, Chuvochina M, Rinke C, Mussig AJ, Chaumeil PA, Hugenholtz P.

The Genome Taxonomy Database (GTDB; https://gtdb.ecogenomic.org) provides a phylogenetically consistent and rank normalized genome-based taxonomy for prokaryotic genomes sourced from the NCBI Assembly database. GTDB R06-RS202 spans 254 090 bacterial and 4316 archaeal genomes, a 270% increase since the introduction of the GTDB in November, 2017. These genomes are organized into 45 555 bacterial and 2339 archaeal species clusters which is a 200% increase since the integration of species clusters into the GTDB in June, 2019. Here, we explore prokaryotic diversity from the perspective of the GTDB and highlight the importance of metagenome-assembled genomes in expanding available genomic representation. We also discuss improvements to the GTDB website which allow tracking of taxonomic changes, easy assessment of genome assembly quality, and identification of genomes assembled from type material or used as species representatives. Methodological updates and policy changes made since the inception of the GTDB are then described along with the procedure used to update species clusters in the GTDB. We conclude with a discussion on the use of average nucleotide identities as a pragmatic approach for delineating prokaryotic species.

Nucleic Acids Res. 2021:() | 1406 Citations (from Europe PMC, 2025-12-13)
31730192
GTDB-Tk: a toolkit to classify genomes with the Genome Taxonomy Database. [PMID: 31730192]
Chaumeil PA, Mussig AJ, Hugenholtz P, Parks DH.

SUMMARY:The GTDB Toolkit (GTDB-Tk) provides objective taxonomic assignments for bacterial and archaeal genomes based on the Genome Taxonomy Database (GTDB). GTDB-Tk is computationally efficient and able to classify thousands of draft genomes in parallel. Here we demonstrate the accuracy of the GTDB-Tk taxonomic assignments by evaluating its performance on a phylogenetically diverse set of 10,156 bacterial and archaeal metagenome-assembled genomes. AVAILABILITY:GTDB-Tk is implemented in Python and licensed under the GNU General Public License v3.0. Source code and documentation are available at: https://github.com/ecogenomics/gtdbtk. SUPPLEMENTARY INFORMATION:Supplementary data are available at Bioinformatics online.

Bioinformatics. 2019:() | 3245 Citations (from Europe PMC, 2025-12-13)
30148503
A standardized bacterial taxonomy based on genome phylogeny substantially revises the tree of life. [PMID: 30148503]
Parks DH, Chuvochina M, Waite DW, Rinke C, Skarshewski A, Chaumeil PA, Hugenholtz P.

Taxonomy is an organizing principle of biology and is ideally based on evolutionary relationships among organisms. Development of a robust bacterial taxonomy has been hindered by an inability to obtain most bacteria in pure culture and, to a lesser extent, by the historical use of phenotypes to guide classification. Culture-independent sequencing technologies have matured sufficiently that a comprehensive genome-based taxonomy is now possible. We used a concatenated protein phylogeny as the basis for a bacterial taxonomy that conservatively removes polyphyletic groups and normalizes taxonomic ranks on the basis of relative evolutionary divergence. Under this approach, 58% of the 94,759 genomes comprising the Genome Taxonomy Database had changes to their existing taxonomy. This result includes the description of 99 phyla, including six major monophyletic units from the subdivision of the Proteobacteria, and amalgamation of the Candidate Phyla Radiation into a single phylum. Our taxonomy should enable improved classification of uncultured bacteria and provide a sound basis for ecological and evolutionary studies.

Nat Biotechnol. 2018:36(10) | 2277 Citations (from Europe PMC, 2025-12-13)

Ranking

All databases:
15/6895 (99.797%)
Phylogeny and homology:
1/302 (100%)
Standard ontology and nomenclature:
3/238 (99.16%)
15
Total Rank
6,426
Citations
918
z-index

Community reviews

Not Rated
Data quality & quantity:
Content organization & presentation
System accessibility & reliability:

Word cloud

Related Databases

Citing
Cited by

Record metadata

Created on: 2019-10-27
Curated by:
Dong Zou [2021-10-19]
Lina Ma [2019-11-28]
Amjad Ali [2019-10-27]