Database Commons
Database Commons

a catalog of worldwide biological databases

Database Profile

VirGen

General information

URL: http://bioinfo.ernet.in/virgen/virgen
Full name: viral genomes
Description: VirGen, a comprehensive viral genome resource that serves as an annotation and analysis pipeline has been developed for the curation of public domain viral genome data
Year founded: 2004
Last update:
Version:
Accessibility:
Unaccessible
Country/Region: India

Classification & Tag

Data type:
Data object:
Database category:
Major species:
Keywords:

Contact information

University/Institution: Savitribai Phule Pune University
Address: Bioinformatics Centre, University of Pune, Pune 411 007 India.
City: Pune
Province/State:
Country/Region: India
Contact name (PI/Team): Kulkarni-Kale U
Contact email (PI/Helpdesk): urmila@bioinfo.ernet.in

Publications

17254296
Curation of viral genomes: challenges, applications and the way forward. [PMID: 17254296]
Kulkarni-Kale U, Bhosle SG, Manjari GS, Joshi M, Bansode S, Kolaskar AS.

BACKGROUND: Whole genome sequence data is a step towards generating the 'parts list' of life to understand the underlying principles of Biocomplexity. Genome sequencing initiatives of human and model organisms are targeted efforts towards understanding principles of evolution with an application envisaged to improve human health. These efforts culminated in the development of dedicated resources. Whereas a large number of viral genomes have been sequenced by groups or individuals with an interest to study antigenic variation amongst strains and species. These independent efforts enabled viruses to attain the status of 'best-represented taxa' with the highest number of genomes. However, due to lack of concerted efforts, viral genomic sequences merely remained as entries in the public repositories until recently.
RESULTS: VirGen is a curated resource of viral genomes and their analyses. Since its first release, it has grown both in terms of coverage of viral families and development of new modules for annotation and analysis. The current release (2.0) includes data for twenty-five families with broad host range as against eight in the first release. The taxonomic description of viruses in VirGen is in accordance with the ICTV nomenclature. A well-characterised strain is identified as a 'representative entry' for every viral species. This non-redundant dataset is used for subsequent annotation and analyses using sequenced-based Bioinformatics approaches. VirGen archives precomputed data on genome and proteome comparisons. A new data module that provides structures of viral proteins available in PDB has been incorporated recently. One of the unique features of VirGen is predicted conformational and sequential epitopes of known antigenic proteins using in-house developed algorithms, a step towards reverse vaccinology.
CONCLUSION: Structured organization of genomic data facilitates use of data mining tools, which provides opportunities for knowledge discovery. One of the approaches to achieve this goal is to carry out functional annotations using comparative genomics. VirGen, a comprehensive viral genome resource that serves as an annotation and analysis pipeline has been developed for the curation of public domain viral genome data http://bioinfo.ernet.in/virgen/virgen.html. Various steps in the curation and annotation of the genomic data and applications of the value-added derived data are substantiated with case studies.

BMC Bioinformatics. 2006:7 Suppl 5() | 3 Citations (from Europe PMC, 2026-02-28)
14681415
VirGen: a comprehensive viral genome resource. [PMID: 14681415]
Kulkarni-Kale U, Bhosle S, Manjari GS, Kolaskar AS.

VirGen is a comprehensive viral genome resource that organizes the 'sequence space' of viral genomes in a structured fashion. It has been developed with the objective of serving as an annotated and curated database comprising complete genome sequences of viruses, value-added derived data and data mining tools. The current release (v1.1) contains 559 complete genomes in addition to 287 putative genomes of viruses belonging to eight viral families for which the host range includes animals and plants. Viral genomes in VirGen are annotated using sequence-based Bioinformatics approaches. The genomic data is also curated to identify 'alternate names' of viral proteins, where available. VirGen archives the results of comparisons of genomes, proteomes and individual proteins within and between viral species. It is the first resource to provide phylogenetic trees of viral species computed using whole-genome sequence data. The module of predicted B-cell antigenic determinants in VirGen is an attempt to link the genome to its vaccinome. Comparative genome analysis data facilitate the study of genome organization and evolution of viruses, which would have implications in applied research to identify candidates for the design of vaccines and antiviral drugs. VirGen is a relational database and is available at http://bioinfo. ernet.in/virgen/virgen.html.

Nucleic Acids Res. 2004:32(Database issue) | 15 Citations (from Europe PMC, 2026-02-28)

Ranking

All databases:
5444/6932 (21.48%)
Raw bio-data:
444/587 (24.532%)
Gene genome and annotation:
1636/2039 (19.814%)
Structure:
757/972 (22.222%)
5444
Total Rank
18
Citations
0.818
z-index

Community reviews

Not Rated
Data quality & quantity:
Content organization & presentation
System accessibility & reliability:

Word cloud

Related Databases

Citing
Cited by

Record metadata

Created on: 2018-01-27
Curated by:
Meiye Jiang [2018-02-24]
Qi Wang [2018-01-27]