Database Commons
Database Commons

a catalog of worldwide biological databases

Database Profile

PlantTribes

General information

URL: http://fgp.huck.psu.edu/tribe.html
Full name:
Description: PlantTribes 2.0 is an objective classification system for plant proteins based on cluster analyses of the inferred proteomes.
Year founded: 2008
Last update: 2008-12-31
Version: 2.0
Accessibility:
Accessible
Country/Region: United States

Classification & Tag

Data type:
Data object:
Database category:
Major species:
NA
Keywords:

Contact information

University/Institution: Pennsylvania State University
Address: University Park, Pennsylvania 16802, USA
City: University Park
Province/State: Pennsylvania
Country/Region: United States
Contact name (PI/Team): Claude W. dePamphilis
Contact email (PI/Helpdesk): cwd3@psu.edu

Publications

18073194
PlantTribes: a gene and gene family resource for comparative genomics in plants. [PMID: 18073194]
Wall PK, Leebens-Mack J, Müller KF, Field D, Altman NS, dePamphilis CW.

The PlantTribes database (http://fgp.huck.psu.edu/tribe.html) is a plant gene family database based on the inferred proteomes of five sequenced plant species: Arabidopsis thaliana, Carica papaya, Medicago truncatula, Oryza sativa and Populus trichocarpa. We used the graph-based clustering algorithm MCL [Van Dongen (Technical Report INS-R0010 2000) and Enright et al. (Nucleic Acids Res. 2002; 30: 1575-1584)] to classify all of these species' protein-coding genes into putative gene families, called tribes, using three clustering stringencies (low, medium and high). For all tribes, we have generated protein and DNA alignments and maximum-likelihood phylogenetic trees. A parallel database of microarray experimental results is linked to the genes, which lets researchers identify groups of related genes and their expression patterns. Unified nomenclatures were developed, and tribes can be related to traditional gene families and conserved domain identifiers. SuperTribes, constructed through a second iteration of MCL clustering, connect distant, but potentially related gene clusters. The global classification of nearly 200 000 plant proteins was used as a scaffold for sorting approximately 4 million additional cDNA sequences from over 200 plant species. All data and analyses are accessible through a flexible interface allowing users to explore the classification, to place query sequences within the classification, and to download results for further study.

Nucleic Acids Res. 2008:36(Database issue) | 60 Citations (from Europe PMC, 2026-05-23)

Ranking

All databases:
2954/6932 (57.4%)
Gene genome and annotation:
919/2040 (55%)
2954
Total Rank
59
Citations
3.278
z-index

Community reviews

Not Rated
Data quality & quantity:
Content organization & presentation
System accessibility & reliability:

Word cloud

Related Databases

Citing
Cited by

Record metadata

Created on: 2015-07-27
Curated by:
Lin Liu [2022-08-20]
Lina Ma [2018-06-13]
Shixiang Sun [2016-03-25]
Mengwei Li [2016-02-20]
Lina Ma [2015-12-21]
Shixiang Sun [2015-11-21]
Lina Ma [2015-11-10]