Database Commons
Database Commons

a catalog of worldwide biological databases

Database Profile

GMGD

General information

URL: https://db.cngb.org/pop/gmgd
Full name: Guizhou Multi-ethnic Genome Database
Description: The GMGD database comprises more than 16.33 million variants in GRCh38 and 16.20 million variants in GRCh37.
Year founded: 2023
Last update: 2024-06-17
Version: v1.0
Accessibility:
Accessible
Country/Region: China

Classification & Tag

Data type:
DNA
Data object:
Database category:
Major species:
Keywords:

Contact information

University/Institution: Guizhou Medical University
Address: Guiyang, 550004, Guizhou, China.
City:
Province/State:
Country/Region: China
Contact name (PI/Team): Xiaolan Qi
Contact email (PI/Helpdesk): xlq@gmc.edu.cn

Publications

38886537
A comprehensive whole genome database of ethnic minority populations. [PMID: 38886537]
Yan He, Changgui Lei, Chanjuan Wan, Shuang Zeng, Ting Zhang, Fei Luo, Ruichao Li, Xiaokun Li, Anshu Zhao, Defu Xiao, Yunyan Luo, Keren Shan, Xiaolan Qi, Xin Jin

China, is characterized by its remarkable ethnical diversity, which necessitates whole genome variation data from multiple populations as crucial tools for advancing population genetics and precision medical research. However, there has been a scarcity of research concentrating on the whole genome of ethnic minority groups. To fill this gap, we developed the Guizhou Multi-ethnic Genome Database (GMGD). It comprises whole genome sequencing data from 476 healthy unrelated individuals spanning 11 ethnic minorities groups in Guizhou Province, Southwest China, including Bouyei, Dong, Miao, Yi, Bai, Gelo, Zhuang, Tujia, Yao, Hui, and Sui. The GMGD database comprises more than 16.33 million variants in GRCh38 and 16.20 million variants in GRCh37. Among these, approximately 11.9% (1,956,322) of the variants in GRCh38 and 18.5% (3,009,431) of the variants in GRCh37 are entirely new and do not exist in the dbSNP database. These novel variants shed light on the genetic diversity landscape across these populations, providing valuable insights with an average coverage of 5.5 ×. This makes GMGD the largest genome-wide database encompassing the most diverse ethnic groups to date. The GMGD interactive interface facilitates researchers with multi-dimensional mutation search methods and displays population frequency differences among global populations. Furthermore, GMGD is equipped with a genotype-imputation function, enabling enhanced capabilities for low-depth genomic research or targeted region capture studies. GMGD offers unique insights into the genomic variation landscape of different ethnic groups, which are freely accessible at https://db.cngb.org/pop/gmgd/ .

Sci Rep. 2024:14(1) | 2 Citations (from Europe PMC, 2025-12-13)

Ranking

All databases:
4125/6895 (40.189%)
Gene genome and annotation:
1273/2021 (37.061%)
Genotype phenotype and variation:
594/1005 (40.995%)
4125
Total Rank
2
Citations
2
z-index

Community reviews

Not Rated
Data quality & quantity:
Content organization & presentation
System accessibility & reliability:

Word cloud

Related Databases

Citing
Cited by

Record metadata

Created on: 2024-07-16
Curated by:
Wenzhuo Cheng [2024-08-28]
Wenzhuo Cheng [2024-07-24]
Miaomiao Wang [2024-07-16]