Database Commons
Database Commons

a catalog of worldwide biological databases

Database Profile

PyloriGene

General information

URL: http://genolist.pasteur.fr/PyloriGene
Full name: The PyloriGene database
Description: Huge amounts of genomic information are currently being generated. Therefore, biologists require structured, exhaustive and comparative databases. The PyloriGene database was developed to respond to these needs, by integrating and connecting the information generated during the sequencing of two distinct strains of Helicobacter pylori.
Year founded: 2003
Last update: 2002
Version: 1.6
Accessibility:
Accessible
Country/Region: France

Classification & Tag

Data type:
DNA
Data object:
Database category:
Major species:
Keywords:

Contact information

University/Institution: Pasteur Institute
Address: Unité de Pathogénie Bactérienne des Muqueuses, Institut Pasteur, Paris, France.
City: Paris
Province/State:
Country/Region: France
Contact name (PI/Team): Ivo G. Boneca
Contact email (PI/Helpdesk): bonecai@pasteur.fr

Publications

12626712
A revised annotation and comparative analysis of Helicobacter pylori genomes. [PMID: 12626712]
Boneca IG, de Reuse H, Epinat JC, Pupin M, Labigne A, Moszer I.

Huge amounts of genomic information are currently being generated. Therefore, biologists require structured, exhaustive and comparative databases. The PyloriGene database (http://genolist.pasteur.fr/PyloriGene) was developed to respond to these needs, by integrating and connecting the information generated during the sequencing of two distinct strains of Helicobacter pylori. This led to the need for a general annotation consensus, as the physical and functional annotations of the two strains differed significantly in some cases. A revised functional classification system was created to accommodate the existing data and to make it possible to classify coding sequences (CDS) into several functional categories to harmonize CDS classification. The annotation of the two complete genomes was revised in the light of new data, allowing us to reduce the percentage of hypothetical proteins from approximately 40 to 33%. This resulted in the reassignment of functions for 108 CDS (approximately 7% of all CDS). Interestingly, the functions of only approximately 13% of CDS (222 out of 1658 CDS) were annotated as a result of work done directly on H.pylori genes. Finally, comparison of the two published genomes revealed a significant amount of size variation between corresponding (orthologous) CDS. Most of these size variations were due to natural polymorphisms, although other sources of variation were identified, such as pseudogenes, new genes potentially regulated by slipped-strand mispairing mechanism, or frame-shifts. 113 of these differences were due to different start codon assignments, a common problem when constructing physical annotations.

Nucleic Acids Res. 2003:31(6) | 57 Citations (from Europe PMC, 2025-12-13)

Ranking

All databases:
3592/6895 (47.919%)
Gene genome and annotation:
1113/2021 (44.978%)
3592
Total Rank
57
Citations
2.591
z-index

Community reviews

Not Rated
Data quality & quantity:
Content organization & presentation
System accessibility & reliability:

Word cloud

Related Databases

Citing
Cited by

Record metadata

Created on: 2018-02-09
Curated by:
Mengyu Pan [2018-09-21]
Qi Wang [2018-02-22]