Database Commons
Database Commons

a catalog of worldwide biological databases

Database Profile

TSE

General information

URL: http://dswww02.pha.jhu.edu/TSEweb
Full name: The Terabase Search Engine
Description: a large-scale relational database of short-read sequences.
Year founded: 2019
Last update: 2016-11-22
Version:
Accessibility:
Accessible
Country/Region: United States

Classification & Tag

Data type:
DNA
Data object:
Database category:
Major species:
Keywords:

Contact information

University/Institution: Johns Hopkins University
Address:
City:
Province/State:
Country/Region: United States
Contact name (PI/Team): Steven L Salzberg
Contact email (PI/Helpdesk):

Publications

30052772
The Terabase Search Engine: a large-scale relational database of short-read sequences. [PMID: 30052772]
Wilton R, Wheelan SJ, Szalay AS, Salzberg SL.

Motivation: DNA sequencing archives have grown to enormous scales in recent years, and thousands of human genomes have already been sequenced. The size of these data sets has made searching the raw read data infeasible without high-performance data-query technology. Additionally, it is challenging to search a repository of short-read data using relational logic and to apply that logic across samples from multiple whole-genome sequencing samples.
Results: We have built a compact, efficiently-indexed database that contains the raw read data for over 250 human genomes, encompassing trillions of bases of DNA, and that allows users to search these data in real time. The Terabase Search Engine enables retrieval from this database of all the reads for any genomic location in a matter of seconds. Users can search using a range of positions or a specific sequence that is aligned to the genome on the fly.
Availability: Public access to the Terabase Search Engine database is available at http://dswww02.pha.jhu.edu/TSEweb.
Supplementary information: Supplementary data are available at Bioinformatics online.

Bioinformatics. 2019:35(4) | 5 Citations (from Europe PMC, 2025-12-13)

Ranking

All databases:
5603/6895 (18.753%)
Gene genome and annotation:
1700/2021 (15.933%)
5603
Total Rank
5
Citations
0.833
z-index

Community reviews

Not Rated
Data quality & quantity:
Content organization & presentation
System accessibility & reliability:

Word cloud

Related Databases

Citing
Cited by

Record metadata

Created on: 2019-01-02
Curated by:
Dong Zou [2019-01-08]
Dong Zou [2019-01-02]