Database Commons
Database Commons

a catalog of worldwide biological databases

Database Profile

Factorbook

General information

URL: http://factorbook.org
Full name: Factorbook
Description: Factorbook contains results of the analysis about sequence features and chromatin structure around the genomic regions bound by 119 human transcription factors.
Year founded: 2012
Last update:
Version:
Accessibility:
Accessible
Country/Region: United States

Classification & Tag

Data type:
DNA
Data object:
Database category:
Major species:
Keywords:

Contact information

University/Institution: University of Massachusetts
Address: Program in Bioinformatics and Integrative Biology, Department of Biochemistry and Molecular Pharmacology, University of Massachusetts Medical School, Worcester, Massachusetts 01605, USA
City: Worcester
Province/State: Massachusetts
Country/Region: United States
Contact name (PI/Team): Zhiping Weng
Contact email (PI/Helpdesk): Zhiping.Weng@umassmed.edu

Publications

23203885
Factorbook.org: a Wiki-based database for transcription factor-binding data generated by the ENCODE consortium. [PMID: 23203885]
Wang J, Zhuang J, Iyer S, Lin XY, Greven MC, Kim BH, Moore J, Pierce BG, Dong X, Virgil D, Birney E, Hung JH, Weng Z.

The Encyclopedia of DNA Elements (ENCODE) consortium aims to identify all functional elements in the human genome including transcripts, transcriptional regulatory regions, along with their chromatin states and DNA methylation patterns. The ENCODE project generates data utilizing a variety of techniques that can enrich for regulatory regions, such as chromatin immunoprecipitation (ChIP), micrococcal nuclease (MNase) digestion and DNase I digestion, followed by deeply sequencing the resulting DNA. As part of the ENCODE project, we have developed a Web-accessible repository accessible at http://factorbook.org. In Wiki format, factorbook is a transcription factor (TF)-centric repository of all ENCODE ChIP-seq datasets on TF-binding regions, as well as the rich analysis results of these data. In the first release, factorbook contains 457 ChIP-seq datasets on 119 TFs in a number of human cell lines, the average profiles of histone modifications and nucleosome positioning around the TF-binding regions, sequence motifs enriched in the regions and the distance and orientation preferences between motif sites.

Nucleic Acids Res. 2013:41(Database issue) | 243 Citations (from Europe PMC, 2025-12-20)
22955990
Sequence features and chromatin structure around the genomic regions bound by 119 human transcription factors. [PMID: 22955990]
Wang J, Zhuang J, Iyer S, Lin X, Whitfield TW, Greven MC, Pierce BG, Dong X, Kundaje A, Cheng Y, Rando OJ, Birney E, Myers RM, Noble WS, Snyder M, Weng Z.

Chromatin immunoprecipitation coupled with high-throughput sequencing (ChIP-seq) has become the dominant technique for mapping transcription factor (TF) binding regions genome-wide. We performed an integrative analysis centered around 457 ChIP-seq data sets on 119 human TFs generated by the ENCODE Consortium. We identified highly enriched sequence motifs in most data sets, revealing new motifs and validating known ones. The motif sites (TF binding sites) are highly conserved evolutionarily and show distinct footprints upon DNase I digestion. We frequently detected secondary motifs in addition to the canonical motifs of the TFs, indicating tethered binding and cobinding between multiple TFs. We observed significant position and orientation preferences between many cobinding TFs. Genes specifically expressed in a cell line are often associated with a greater occurrence of nearby TF binding in that cell line. We observed cell-line-specific secondary motifs that mediate the binding of the histone deacetylase HDAC2 and the enhancer-binding protein EP300. TF binding sites are located in GC-rich, nucleosome-depleted, and DNase I sensitive regions, flanked by well-positioned nucleosomes, and many of these features show cell type specificity. The GC-richness may be beneficial for regulating TF binding because, when unoccupied by a TF, these regions are occupied by nucleosomes in vivo. We present the results of our analysis in a TF-centric web repository Factorbook (http://factorbook.org) and will continually update this repository as more ENCODE data are generated.

Genome Res. 2012:22(9) | 672 Citations (from Europe PMC, 2025-12-20)

Ranking

All databases:
253/6895 (96.345%)
Gene genome and annotation:
95/2021 (95.349%)
Structure:
25/967 (97.518%)
253
Total Rank
874
Citations
67.231
z-index

Community reviews

Not Rated
Data quality & quantity:
Content organization & presentation
System accessibility & reliability:

Word cloud

Related Databases

Citing
Cited by

Record metadata

Created on: 2018-01-29
Curated by:
Fatima Batool [2018-09-04]
Fatima Batool [2018-08-11]
Pei Wang [2018-03-27]
Pei Wang [2018-03-08]
Pei Wang [2018-03-06]