Database Commons
Database Commons

a catalog of worldwide biological databases

Database Profile

DBTSS

General information

URL: http://dbtss.hgc.jp/
Full name: DataBase of Transcriptional Start Sites
Description: DBTSS is an integrative platform for transcriptome,epigenome and genome sequence variation data which represents exact positions of transcriptional start sites (TSSs) in the genome based on our unique experimental validated TSS sequencing method, TSS-seq.
Year founded: 2002
Last update: 2014-9-15
Version: v9.0
Accessibility:
Accessible
Country/Region: Japan

Classification & Tag

Data type:
DNA
Data object:
Database category:
Major species:
Keywords:

Contact information

University/Institution: University of Tokyo
Address: Department of Medical Genome Sciences,Graduate School of Frontier Sciences,The University of Tokyo,Chiba,Japan
City: Chiba
Province/State:
Country/Region: Japan
Contact name (PI/Team): Yutaka Suzuki
Contact email (PI/Helpdesk): ysuzuki@k.u-tokyo.ac.jp

Publications

29126224
DBTSS/DBKERO for integrated analysis of transcriptional regulation. [PMID: 29126224]
Suzuki A, Kawano S, Mitsuyama T, Suyama M, Kanai Y, Shirahige K, Sasaki H, Tokunaga K, Tsuchihara K, Sugano S, Nakai K, Suzuki Y.

DBTSS (Database of Transcriptional Start Sites)/DBKERO (Database of Kashiwa Encyclopedia for human genome mutations in Regulatory regions and their Omics contexts) is the database originally initiated with the information of transcriptional start sites and their upstream transcriptional regulatory regions. In recent years, we updated the database to assist users to elucidate biological relevance of the human genome variations or somatic mutations in cancers which may affect the transcriptional regulation. In this update, we facilitate interpretations of disease associated genomic variation, using the Japanese population as a model case. We enriched the genomic variation dataset consisting of the 13,368 individuals collected for various genome-wide association studies and the reference epigenome information in the surrounding regions using a total of 455 epigenome datasets (four tissue types from 67 healthy individuals) collected for the International Human Epigenome Consortium (IHEC). The data directly obtained from the clinical samples was associated with that obtained from various model systems, such as the drug perturbation datasets using cultured cancer cells. Furthermore, we incorporated the results obtained using the newly developed analytical methods, Nanopore/10x Genomics long-read sequencing of the human genome and single cell analyses. The database is made publicly accessible at the URL (http://dbtss.hgc.jp/).

Nucleic Acids Res. 2018:46(D1) | 44 Citations (from Europe PMC, 2025-12-13)
25378318
DBTSS as an integrative platform for transcriptome, epigenome and genome sequence variation data. [PMID: 25378318]
Suzuki A, Wakaguri H, Yamashita R, Kawano S, Tsuchihara K, Sugano S, Suzuki Y, Nakai K.

DBTSS (http://dbtss.hgc.jp/) was originally constructed as a collection of uniquely determined transcriptional start sites (TSSs) in humans and some other species in 2002. Since then, it has been regularly updated and in recent updates epigenetic information has also been incorporated because such information is useful for characterizing the biological relevance of these TSSs/downstream genes. In the newest release, Release 9, we further integrated public and original single nucleotide variation (SNV) data into our database. For our original data, we generated SNV data from genomic analyses of various cancer types, including 97 lung adenocarcinomas and 57 lung small cell carcinomas from Japanese patients as well as 26 cell lines of lung cancer origin. In addition, we obtained publically available SNV data from other cancer types and germline variations in total of 11,322 individuals. With these updates, users can examine the association between sequence variation pattern in clinical lung cancers with its corresponding TSS-seq, RNA-seq, ChIP-seq and BS-seq data. Consequently, DBTSS is no longer a mere storage site for TSS information but has evolved into an integrative platform of a variety of genome activity data. © The Author(s) 2014. Published by Oxford University Press on behalf of Nucleic Acids Research.

Nucleic Acids Res. 2015:43(Database issue) | 41 Citations (from Europe PMC, 2025-12-20)
22086958
DBTSS: DataBase of Transcriptional Start Sites progress report in 2012. [PMID: 22086958]
Yamashita R, Sugano S, Suzuki Y, Nakai K.

To support transcriptional regulation studies, we have constructed DBTSS (DataBase of Transcriptional Start Sites), which contains exact positions of transcriptional start sites (TSSs), determined with our own technique named TSS-seq, in the genomes of various species. In its latest version, DBTSS covers the data of the majority of human adult and embryonic tissues: it now contains 418 million TSS tag sequences from 28 tissues/cell cultures. Moreover, we integrated a series of our own transcriptomic data, such as the RNA-seq data of subcellular-fractionated RNAs as well as the ChIP-seq data of histone modifications and the binding of RNA polymerase II/several transcription factors in cultured cell lines into our original TSS information. We also included several external epigenomic data, such as the chromatin map of the ENCODE project. We further associated our TSS information with public or original single-nucleotide variation (SNV) data, in order to identify SNVs in the regulatory regions. These data can be browsed in our new viewer, which supports versatile search conditions of users. We believe that our new DBTSS will be an invaluable resource for interpreting the differential uses of TSSs and for identifying human genetic variations that are associated with disordered transcriptional regulation. DBTSS can be accessed at http://dbtss.hgc.jp.

Nucleic Acids Res. 2012:40(Database issue) | 38 Citations (from Europe PMC, 2025-12-20)
19910371
DBTSS provides a tissue specific dynamic view of Transcription Start Sites. [PMID: 19910371]
Yamashita R, Wakaguri H, Sugano S, Suzuki Y, Nakai K.

DataBase of Transcription Start Sites (DBTSS) is a database which contains precise positional information for transcription start sites (TSSs) of eukaryotic mRNAs. In this update, we included 330 million new tags generated by massively sequencing the 5'-end of oligo-cap selected cDNAs in humans and mice. The tags were collected from normal fetal or adult human tissues, including brain, thymus, liver, kidney and heart, from 6 human cell lines in 21 diverse growth conditions as well as from mouse NIH3T3 cell line: altogether 31 different cell types or culture conditions are represented. This unprecedented increase in depth of data now allows DBTSS to faithfully represent the dynamically changing landscape of TSSs in different cell types and conditions, during development and in the course of evolution. Differential usage of alternative 5'-ends across cell types and conditions can be viewed in a series of new interfaces. Promoter sequence information is now displayed in a comparative genomics viewer where evolutionary turnover of the TSSs can be evaluated. DBTSS can be accessed at http://dbtss.hgc.jp/.

Nucleic Acids Res. 2010:38(Database issue) | 51 Citations (from Europe PMC, 2025-12-20)
17942421
DBTSS: database of transcription start sites, progress report 2008. [PMID: 17942421]
Wakaguri H, Yamashita R, Suzuki Y, Sugano S, Nakai K.

DBTSS is a database of transcriptional start sites, based on our unique collection of precise, experimentally determined 5'-end sequences of full-length cDNAs. Since its first release in 2002, several major updates have been made. In this update, we expanded the human transcriptional start site dataset by 19 million uniquely mapped, and RefSeq-associated, 5'-end sequences, which were generated by a newly introduced Solexa sequencer. Moreover, in order to provide means for interpreting those massive TSS data, we implemented two new analytical tools: one for connecting expression information with predicted transcription factor binding sites; the other for examining evolutionary conservation or species-specificity of promoters and transcripts, which can be browsed by our own comparative genome viewer. With the expanded dataset and the enhanced functionalities, DBTSS provides a unique platform that enables in-depth transcriptome analyses. DBTSS is accessible at http://dbtss.hgc.jp/.

Nucleic Acids Res. 2008:36(Database issue) | 100 Citations (from Europe PMC, 2025-12-20)
16381981
DBTSS: DataBase of Human Transcription Start Sites, progress report 2006. [PMID: 16381981]
Yamashita R, Suzuki Y, Wakaguri H, Tsuritani K, Nakai K, Sugano S.

DBTSS was first constructed in 2002 based on precise, experimentally determined 5' end clones. Several major updates and additions have been made since the last report. First, the number of human clones has drastically increased, going from 190,964 to 1,359,000. Second, information about potential alternative promoters is presented because the number of 5' end clones is now sufficient to determine several promoters for one gene. Namely, we defined putative promoter groups by clustering transcription start sites (TSSs) separated by <500 bases. A total of 8308 human genes and 4276 mouse genes were found to have putative multiple promoters. Third, DBTSS provides detailed sequence comparisons of user-specified TSSs. Finally, we have added TSS information for zebrafish, malaria and schyzon (a red algae model organism). DBTSS is accessible at http://dbtss.hgc.jp.

Nucleic Acids Res. 2006:34(Database issue) | 76 Citations (from Europe PMC, 2025-12-20)
14681363
DBTSS, DataBase of Transcriptional Start Sites: progress report 2004. [PMID: 14681363]
Suzuki Y, Yamashita R, Sugano S, Nakai K.

DBTSS (http://dbtss.hgc.jp) was originally constructed based on a collection of experimentally determined TSSs of human genes. Since its first release in 2002, it has been updated several times. First, the amount of stored data has increased significantly: e.g. the number of clones that match both the RefSeq mRNA set and the genome sequence has increased from 111,382 to 190,964, now covering 1,234 genes. Second, the positions of SNPs in dbSNP were displayed on the upstream regions of contained human genes. Third, DBTSS now covers other species such as mouse and the human malaria parasite. It will become a central database containing data for many more species with oligo-capping and related methods. Lastly, the database now serves for comparative promoter analyses: in the current version, comparative views of potentially orthologous promoters from human and mouse are presented with an additional function of searching potential transcription-factor binding sites, which are either conserved or diverged between species.

Nucleic Acids Res. 2004:32(Database issue) | 101 Citations (from Europe PMC, 2025-12-20)
11752328
DBTSS: DataBase of human Transcriptional Start Sites and full-length cDNAs. [PMID: 11752328]
Suzuki Y, Yamashita R, Nakai K, Sugano S.

Although the information of cDNAs is indispensable for analyzing gene function, most of the cDNA sequences stored in current databases are imperfect in the sense that they lack the precise information of 5' end termini. To overcome this difficulty, we have developed the oligo-capping method to obtain full-length cDNAs, the information of which has been partly deposited in public databases. In this study, we further constructed human cDNA libraries enriched in clones containing the cap structure to systematically explore the 5' end structure of expressed genes. Of approximately 217 402 5' end sequences obtained, 111 382 have been matched to cDNA sequences of known genes (7889 genes) and are presented in our new database, DataBase of Transcriptional Start Sites (DBTSS; http://elmo.ims.u-tokyo.ac.jp/dbtss/). Sequence comparison between our entries and those of a reference sequence database, RefSeq, revealed that 4683 (34%) of RefSeq sequences should be extended towards the 5' ends. We also mapped each sequence on the human draft genome sequence to identify its transcriptional start site, which provides us with more detailed information on distribution patterns of transcriptional start sites and adjacent regulatory regions.

Nucleic Acids Res. 2002:30(1) | 156 Citations (from Europe PMC, 2025-12-20)

Ranking

All databases:
618/6895 (91.051%)
Gene genome and annotation:
222/2021 (89.065%)
Expression:
105/1347 (92.279%)
618
Total Rank
597
Citations
25.956
z-index

Community reviews

Not Rated
Data quality & quantity:
Content organization & presentation
System accessibility & reliability:

Word cloud

Related Databases

Citing
Cited by

Record metadata

Created on: 2015-06-20
Curated by:
Lin Liu [2022-08-16]
Dong Zou [2018-03-05]
Yang Zhang [2018-01-27]
Jian Sang [2016-04-04]
Jian Sang [2015-12-06]
Jian Sang [2015-06-26]