Database Commons
Database Commons

a catalog of worldwide biological databases

Database Profile

Diseasome

General information

URL: http://diseasome.kobic.re.kr/
Full name:
Description: This database is an integrated database of genes, genetic variation and diseases. This database provides a disease thesaurus with a tree view of that shows the number of genes that are associated with diseases, and a genome browser for conveniently looking up potentially deleterious SNPs among the genes that are strongly associated with specific diseases and clinical phenotypes. It also provides semi-automatic ways of deriving the list of candidate SNPs to be evaluated in epidemiological or molecular biological experiments for disease association studies. Currently, it contains 14,674 records on genetic variation and 109,715 records on genes related to human diseases.
Year founded: 2008
Last update:
Version:
Accessibility:
Accessible
Country/Region: Korea, Republic of

Classification & Tag

Data type:
DNA
Data object:
Database category:
Major species:
Keywords:

Contact information

University/Institution: Korea Research Institute of Bioscience and Biotechnology
Address:
City:
Province/State:
Country/Region: Korea, Republic of
Contact name (PI/Team): Tae-Kwon Sohn
Contact email (PI/Helpdesk): loubert@naver.com

Publications

19091018
An integrated database-pipeline system for studying single nucleotide polymorphisms and diseases. [PMID: 19091018]
Yang JO, Hwang S, Oh J, Bhak J, Sohn TK.

BACKGROUND: Studies on the relationship between disease and genetic variations such as single nucleotide polymorphisms (SNPs) are important. Genetic variations can cause disease by influencing important biological regulation processes. Despite the needs for analyzing SNP and disease correlation, most existing databases provide information only on functional variants at specific locations on the genome, or deal with only a few genes associated with disease. There is no combined resource to widely support gene-, SNP-, and disease-related information, and to capture relationships among such data. Therefore, we developed an integrated database-pipeline system for studying SNPs and diseases.
RESULTS: To implement the pipeline system for the integrated database, we first unified complicated and redundant disease terms and gene names using the Unified Medical Language System (UMLS) for classification and noun modification, and the HUGO Gene Nomenclature Committee (HGNC) and NCBI gene databases. Next, we collected and integrated representative databases for three categories of information. For genes and proteins, we examined the NCBI mRNA, UniProt, UCSC Table Track and MitoDat databases. For genetic variants we used the dbSNP, JSNP, ALFRED, and HGVbase databases. For disease, we employed OMIM, GAD, and HGMD databases. The database-pipeline system provides a disease thesaurus, including genes and SNPs associated with disease. The search results for these categories are available on the web page http://diseasome.kobic.re.kr/, and a genome browser is also available to highlight findings, as well as to permit the convenient review of potentially deleterious SNPs among genes strongly associated with specific diseases and clinical phenotypes.
CONCLUSION: Our system is designed to capture the relationships between SNPs associated with disease and disease-causing genes. The integrated database-pipeline provides a list of candidate genes and SNP markers for evaluation in both epidemiological and molecular biological approaches to diseases-gene association studies. Furthermore, researchers then can decide semi-automatically the data set for association studies while considering the relationships between genetic variation and diseases. The database can also be economical for disease-association studies, as well as to facilitate an understanding of the processes which cause disease. Currently, the database contains 14,674 SNP records and 109,715 gene records associated with human diseases and it is updated at regular intervals.

BMC Bioinformatics. 2008:9 Suppl 12() | 12 Citations (from Europe PMC, 2025-03-29)

Ranking

All databases:
5006/6278 (20.277%)
Genotype phenotype and variation:
709/898 (21.158%)
Health and medicine:
1200/1501 (20.12%)
5006
Total Rank
12
Citations
0.75
z-index

Community reviews

Not Rated
Data quality & quantity:
Content organization & presentation
System accessibility & reliability:

Word cloud

Related Databases

Citing
Cited by

Record metadata

Created on: 2018-01-26
Curated by:
Mengyu Pan [2018-09-21]
Mengyu Pan [2018-02-22]
Pei Wang [2018-01-26]