Database Commons
Database Commons

a catalog of worldwide biological databases

Database Profile

CAGm

General information

URL: http://www.cagmdb.org
Full name: Germline Microsatellites
Description: a database of germline microsatellites from 2529 individuals in the 1000 genomes project.
Year founded: 2019
Last update: 2018-09-16
Version: v1.2
Accessibility:
Accessible
Country/Region: United States

Classification & Tag

Data type:
DNA
Data object:
Database category:
Major species:
Keywords:

Contact information

University/Institution: Edward Via College of Osteopathic Medicine
Address: 2265 Kraft Drive, Blacksburg, VA 24060, USA.
City: Blacksburg
Province/State:
Country/Region: United States
Contact name (PI/Team): Nicholas Kinney
Contact email (PI/Helpdesk): nkinney06@gmail.com

Publications

30329086
CAGm: a repository of germline microsatellite variations in the 1000 genomes project. [PMID: 30329086]
Kinney N, Titus-Glover K, Wren JD, Varghese RT, Michalak P, Liao H, Anandakrishnan R, Pulenthiran A, Kang L, Garner HR.

The human genome harbors an abundance of repetitive DNA; however, its function continues to be debated. Microsatellites-a class of short tandem repeat-are established as an important source of genetic variation. Array length variants are common among microsatellites and affect gene expression; but, efforts to understand the role and diversity of microsatellite variation has been hampered by several challenges. Without adequate depth, both long-read and short-read sequencing may not detect the variants present in a sample; additionally, large sample sizes are needed to reveal the degree of population-level polymorphism. To address these challenges we present the Comparative Analysis of Germline Microsatellites (CAGm): a database of germline microsatellites from 2529 individuals in the 1000 genomes project. A key novelty of CAGm is the ability to aggregate microsatellite variation by population, ethnicity (super population) and gender. The database provides advanced searching for microsatellites embedded in genes and functional elements. All data can be downloaded as Microsoft Excel spreadsheets. Two use-case scenarios are presented to demonstrate its utility: a mononucleotide (A) microsatellite at the BAT-26 locus and a dinucleotide (CA) microsatellite in the coding region of FGFRL1. CAGm is freely available at http://www.cagmdb.org/.

Nucleic Acids Res. 2019:47(D1) | 7 Citations (from Europe PMC, 2025-12-13)

Ranking

All databases:
5191/6895 (24.728%)
Genotype phenotype and variation:
737/1005 (26.766%)
5191
Total Rank
6
Citations
1
z-index

Community reviews

Not Rated
Data quality & quantity:
Content organization & presentation
System accessibility & reliability:

Word cloud

Related Databases

Citing
Cited by

Record metadata

Created on: 2019-01-03
Curated by:
Dong Zou [2019-01-09]
Dong Zou [2019-01-03]