Database Commons
Database Commons

a catalog of worldwide biological databases

Database Profile

WebSTR

General information

URL: https://webstr.ucsd.edu
Full name: WebSTR
Description: WebSTR is a comprehensive resource for genome-wide STR variation in humans, containing data for approximately 1.7 million unique STRs. It enables users to easily view summary statistics including population-specific allele frequencies, mutation rates, and phenotype associations for specific STRs of interest.
Year founded: 2023
Last update: 2023-10-15
Version: v1.0
Accessibility:
Accessible
Country/Region: Sweden

Classification & Tag

Data type:
DNA
Data object:
Database category:
Major species:
Keywords:

Contact information

University/Institution: Stockholm University
Address:
City:
Province/State:
Country/Region: Sweden
Contact name (PI/Team): Melissa Gymrek
Contact email (PI/Helpdesk): mgymrek@ucsd.edu

Publications

37678708
WebSTR: A Population-wide Database of Short Tandem Repeat Variation in Humans. [PMID: 37678708]
Oxana Sachenkova Lundström, Max Adriaan Verbiest, Feifei Xia, Helyaneh Ziaei Jam, Inti Zlobec, Maria Anisimova, Melissa Gymrek

Short tandem repeats (STRs) are consecutive repetitions of one to six nucleotide motifs. They are hypervariable due to the high prevalence of repeat unit insertions or deletions primarily caused by polymerase slippage during replication. Genetic variation at STRs has been shown to influence a range of traits in humans, including gene expression, cancer risk, and autism. Until recently STRs have been poorly studied since they pose significant challenges to bioinformatics analyses. Moreover, genome-wide analysis of STR variation in population-scale cohorts requires large amounts of data and computational resources. However, the recent advent of genome-wide analysis tools has resulted in multiple large genome-wide datasets of STR variation spanning nearly two million genomic loci in thousands of individuals from diverse populations. Here we present WebSTR, a database of genetic variation and other characteristics of genome-wide STRs across human populations. WebSTR is based on reference panels of more than 1.7 million human STRs created with state of the art repeat annotation methods and can easily be extended to include additional cohorts or species. It currently contains data based on STR genotypes for individuals from the 1000 Genomes Project, H3Africa, the Genotype-Tissue Expression (GTEx) Project and colorectal cancer patients from the TCGA dataset. WebSTR is implemented as a relational database with programmatic access available through an API and a web portal for browsing data. The web portal is publicly available at https://webstr.ucsd.edu.

J Mol Biol. 2023:435(20) | 20 Citations (from Europe PMC, 2025-12-20)

Ranking

All databases:
1533/6895 (77.781%)
Gene genome and annotation:
500/2021 (75.309%)
Genotype phenotype and variation:
233/1005 (76.915%)
1533
Total Rank
18
Citations
9
z-index

Community reviews

Not Rated
Data quality & quantity:
Content organization & presentation
System accessibility & reliability:

Word cloud

Related Databases

Citing
Cited by

Record metadata

Created on: 2024-07-16
Curated by:
Wenzhuo Cheng [2024-08-26]
Shiting Wang [2024-07-24]
shaosen zhang [2024-07-16]