Database Commons
Database Commons

a catalog of worldwide biological databases

Database Profile

General information

URL: http://hocomoco.autosome.ru
Full name: Homo Sapiens Comprehensive Model Collection
Description: HOCOMOCO is a collection of human transcription factor binding sites models. Currently contains models for 680 human and 453 mouse TFs.
Year founded: 2013
Last update: 2018-01-04
Version: v11.0
Accessibility:
Manual:
Accessible
Real time : Checking...
Country/Region: Russian Federation

Classification & Tag

Data type:
DNA
Data object:
Database category:
Major species:
Keywords:

Contact information

University/Institution: Russian Academy of Sciences
Address: Vavilov Street 32,Moscow 119991,GSP-1,Russia
City: Moscow
Province/State:
Country/Region: Russian Federation
Contact name (PI/Team): Vsevolod J. Makeev
Contact email (PI/Helpdesk): vsevolod.makeev@gmail.com

Publications

29140464
HOCOMOCO: towards a complete collection of transcription factor binding models for human and mouse via large-scale ChIP-Seq analysis. [PMID: 29140464]
Kulakovskiy IV, Vorontsov IE, Yevshin IS, Sharipov RN, Fedorova AD, Rumynskiy EI, Medvedeva YA, Magana-Mora A, Bajic VB, Papatsenko DA, Kolpakov FA, Makeev VJ.

We present a major update of the HOCOMOCO collection that consists of patterns describing DNA binding specificities for human and mouse transcription factors. In this release, we profited from a nearly doubled volume of published in vivo experiments on transcription factor (TF) binding to expand the repertoire of binding models, replace low-quality models previously based on in vitro data only and cover more than a hundred TFs with previously unknown binding specificities. This was achieved by systematic motif discovery from more than five thousand ChIP-Seq experiments uniformly processed within the BioUML framework with several ChIP-Seq peak calling tools and aggregated in the GTRD database. HOCOMOCO v11 contains binding models for 453 mouse and 680 human transcription factors and includes 1302 mononucleotide and 576 dinucleotide position weight matrices, which describe primary binding preferences of each transcription factor and reliable alternative binding specificities. An interactive interface and bulk downloads are available on the web: http://hocomoco.autosome.ru and http://www.cbrc.kaust.edu.sa/hocomoco11. In this release, we complement HOCOMOCO by MoLoTool (Motif Location Toolbox, http://molotool.autosome.ru) that applies HOCOMOCO models for visualization of binding sites in short DNA sequences.

Nucleic Acids Res. 2018:46(D1) | 349 Citations (from Europe PMC, 2024-04-06)
23175603
HOCOMOCO: a comprehensive collection of human transcription factor binding sites models. [PMID: 23175603]
Kulakovskiy IV, Medvedeva YA, Schaefer U, Kasianov AS, Vorontsov IE, Bajic VB, Makeev VJ.

Transcription factor (TF) binding site (TFBS) models are crucial for computational reconstruction of transcription regulatory networks. In existing repositories, a TF often has several models (also called binding profiles or motifs), obtained from different experimental data. Having a single TFBS model for a TF is more pragmatic for practical applications. We show that integration of TFBS data from various types of experiments into a single model typically results in the improved model quality probably due to partial correction of source specific technique bias. We present the Homo sapiens comprehensive model collection (HOCOMOCO, http://autosome.ru/HOCOMOCO/, http://cbrc.kaust.edu.sa/hocomoco/) containing carefully hand-curated TFBS models constructed by integration of binding sequences obtained by both low- and high-throughput methods. To construct position weight matrices to represent these TFBS models, we used ChIPMunk software in four computational modes, including newly developed periodic positional prior mode associated with DNA helix pitch. We selected only one TFBS model per TF, unless there was a clear experimental evidence for two rather distinct TFBS models. We assigned a quality rating to each model. HOCOMOCO contains 426 systematically curated TFBS models for 401 human TFs, where 172 models are based on more than one data source.

Nucleic Acids Res. 2013:41(Database issue) | 129 Citations (from Europe PMC, 2024-04-06)

Ranking

All databases:
261/6000 (95.667%)
Gene genome and annotation:
97/1675 (94.269%)
261
Total Rank
473
Citations
43
z-index

Community reviews

Not Rated
Data quality & quantity:
Content organization & presentation
System accessibility & reliability:

Word cloud

Related Databases

Citing
Cited by

Record metadata

Created on: 2015-06-20
Curated by:
Amjad Ali [2019-10-26]
[2018-11-27]
Lina Ma [2018-06-11]
Lina Ma [2016-09-23]
Chunlei Yu [2016-04-17]
Chunlei Yu [2016-03-31]
Mengwei Li [2016-01-15]
Lin Liu [2016-01-03]
Chunlei Yu [2015-11-19]
Chunlei Yu [2015-06-28]