Database Commons
Database Commons

a catalog of worldwide biological databases

Database Profile

Repeat Sequence Database

General information

URL: http://rsdb.csie.ncu.edu.tw
Full name: Database of repetitive elements in complete genomes.
Description: Repeat Sequence Database, is first designed and implemented to store complete and comprehensive repetitive sequences. The database contains direct, inverted and palindromic repetitive sequences, and each repetitive sequence has a variable length ranging from seven to many hundred nucleotides.
Year founded: 2003
Last update:
Version:
Accessibility:
Unaccessible
Country/Region: China

Contact information

University/Institution: National Central University
Address: Department of Computer Science and Information Engineering, National Central University, Jung-li City 320, Taiwan, ROC.
City: Taiwan
Province/State: Taiwan
Country/Region: China
Contact name (PI/Team): Jorng-Tzong Horng
Contact email (PI/Helpdesk): horng@db.csie.ncu.edu.tw

Publications

12834164
Database of repetitive elements in complete genomes and data mining using transcription factor binding sites. [PMID: 12834164]
Horng JT, Lin FM, Lin JH, Huang HD, Liu BJ.

Approximately 43% of the human genome is occupied by repetitive elements. Even more, around 51% of the rice genome is occupied by repetitive elements. The analysis presented here indicates that repetitive elements in complete genomes may have been very important in the evolutionary genomics. In this study, a database, called the Repeat Sequence Database, is first designed and implemented to store complete and comprehensive repetitive sequences. See http://rsdb.csie.ncu.edu.tw for more information. The database contains direct, inverted and palindromic repetitive sequences, and each repetitive sequence has a variable length ranging from seven to many hundred nucleotides. The repetitive sequences in the database are explored using a mathematical algorithm to mine rules on how combinations of individual binding sites are distributed among repetitive sequences in the database. Combinations of transcription factor binding sites in the repetitive sequences are obtained and then data mining techniques are applied to mine association rules from these combinations. The discovered associations are further pruned to remove insignificant associations and obtain a set of associations. The mined association rules facilitate efforts to identify gene classes regulated by similar mechanisms and accurately predict regulatory elements. Experiments are performed on several genomes including C. elegans, human chromosome 22, and yeast.

IEEE Trans Inf Technol Biomed. 2003:7(2) | 7 Citations (from Europe PMC, 2025-12-13)

Ranking

All databases:
6457/6895 (6.367%)
Raw bio-data:
517/582 (11.34%)
Gene genome and annotation:
1917/2021 (5.195%)
6457
Total Rank
7
Citations
0.318
z-index

Community reviews

Not Rated
Data quality & quantity:
Content organization & presentation
System accessibility & reliability:

Word cloud

Related Databases

Citing
Cited by

Record metadata

Created on: 2018-02-09
Curated by:
Mengyu Pan [2018-09-21]
Qi Wang [2018-02-22]