Database Commons
Database Commons

a catalog of worldwide biological databases

Database Profile

msRepDB

General information

URL: https://msrepdb.cbrc.kaust.edu.sa/pages/msRepDB/index
Full name: multi-species repeat database
Description: By integrating the detection results of LongRepMarker and existing databases , we here propose msRepDB, which is currently the most comprehensive multi-species repetitive sequence database. msRepDB takes the reference sequence or assembly of species as the input, and generates the masked sequence and comprehensive annotation report as the output.
Year founded: 2022
Last update:
Version:
Accessibility:
Unaccessible
Country/Region: Saudi Arabia

Classification & Tag

Data type:
DNA
Data object:
Database category:
Major species:
Keywords:

Contact information

University/Institution: King Abdullah University of Science and Technology
Address:
City:
Province/State:
Country/Region: Saudi Arabia
Contact name (PI/Team): Xin Gao
Contact email (PI/Helpdesk): xin.gao@kaust.edu.sa

Publications

34850956
msRepDB: a comprehensive repetitive sequence database of over 80 000 species. [PMID: 34850956]
Xingyu Liao, Kang Hu, Adil Salhi, You Zou, Jianxin Wang, Xin Gao

Repeats are prevalent in the genomes of all bacteria, plants and animals, and they cover nearly half of the Human genome, which play indispensable roles in the evolution, inheritance, variation and genomic instability, and serve as substrates for chromosomal rearrangements that include disease-causing deletions, inversions, and translocations. Comprehensive identification, classification and annotation of repeats in genomes can provide accurate and targeted solutions towards understanding and diagnosis of complex diseases, optimization of plant properties and development of new drugs. RepBase and Dfam are two most frequently used repeat databases, but they are not sufficiently complete. Due to the lack of a comprehensive repeat database of multiple species, the current research in this field is far from being satisfactory. LongRepMarker is a new framework developed recently by our group for comprehensive identification of genomic repeats. We here propose msRepDB based on LongRepMarker, which is currently the most comprehensive multi-species repeat database, covering >80 000 species. Comprehensive evaluations show that msRepDB contains more species, and more complete repeats and families than RepBase and Dfam databases. (https://msrepdb.cbrc.kaust.edu.sa/pages/msRepDB/index.html).

Nucleic Acids Res. 2022:50(D1) | 16 Citations (from Europe PMC, 2026-06-06)

Ranking

All databases:
2686/6932 (61.267%)
Gene genome and annotation:
833/2040 (59.216%)
2686
Total Rank
15
Citations
3.75
z-index

Community reviews

Not Rated
Data quality & quantity:
Content organization & presentation
System accessibility & reliability:

Word cloud

Related Databases

Citing
Cited by

Record metadata

Created on: 2022-04-21
Curated by:
Lina Ma [2022-06-25]
Jing Wei [2022-05-14]
Pei Liu [2022-04-21]