BIG Search

BIG Search is a scalable text search engine built based on ElasticSearch (a highly scalable open-source full-text search and analytics engine based on Apache Lucene). It features cross-domain search and facilitates users to gain access to a wide range of biomedical data, not only from NGDC databases but also partner databases throughout the world.

e.g., PRJCA000126;SAMC000385;tp53;EGFR; human; KaKs_Calculator

5,144 records from 35 NGDC & Partner databases.

Database Records Number Description
GSA 3,070 Genome Sequence Archive
BioProject 807 Biological Project Library
BioSample 590 Biological Sample Library
RMVar 172 RNA Modification associated variants database
iEKPD 91 Integrated annotations for Eukaryotic protein Kinases, protein Phosphatases & phosphoprotein-binding Domains
AnimalTFDB 55 AnimalTFDB is a comprehensive database including classification and annotation of genome-wide transcription factors
RhesusBase Genes 44
DEG 36 Database of Essential Genes
DrLLPS 36 Data resource of liquid-liquid phase separation
MethBank SRMs 32 Methbank, Single-base Resolution Methylomes (SRMs)
Gene Expression Nebulas 30 A data portal of transcriptomic profiles across multiple species
vcg 25 Virtual Chinese Genome Database is a dynamic genome database of Chinese population.
nucmap 23 A database of genome-wide nucleosome positioning map across species.
MiCroKiTS 22 Midbody, Centrosome, Kinetochore, Telomere and Spindle
EPSD 21 Eukaryotic Phosphorylation Site Database
hTFtarget 19 In this hTFtarget database, we collected comprehensive human TF ChIP-Seq data and customized an analysis workflow to identify reliable TF targets with taking epigenomic states into account
GenTree 15 GenTree, the time tree of genes along the evolutionary history
lnCAR 10 lnCAR | A comprehensive resource for lncRNAs from Cancer Arrays
dbPAF 7 database of Phospho-sites in Animals and Fungi
BioCode 5 Archive Bioinformatics Codes for Open Source Projects
Methbank CRMs 5 Methbank, Consensus Reference Methylomes (CRMs)
PLMD 5 Protein Lysine Modifications Database
Database Commons 4 Database Commons is a curated catalogue of biological databases, providing people with easy access to a comprehensive collection of publicly available biological databases encompassing different data types and spanning diverse organisms.
BBCancer 4 BBCancer: an expression atlas of blood-based biomarkers in the early diagnosis of cancers
PTMD 4 A database of human disease-associated post-translational modifications
GSA for Human 3 Genome Sequence Archive for Human
CGGA 1 Chinese Glioma Genome Atlas
DiseaseEnhancer 1 DiseaseEnhancer: a resource of human disease-associated enhancer catalog.
EWAS Atlas 1 A knowledgebase of epigenome-wide association studies
EWAS Data Hub 1 A data hub of DNA methylation array data and metadata
CancerSEA 1 CancerSEA: a cancer single-cell state atlas
CGDB 1 Circadian Gene Database
EDK 1 Editome Disease Knowledgebase
lncRNASNP2 1
miRNASNP-v3 1 miRNASNP-v3 is a comprehensive database for SNPs and disease-related variations in miRNAs and miRNA targets
Database Records Number Description

Powered by EBISearch

Database Records Number Description

Powered by NCBI Entrez

Database Records Number Description

Powered by EBI AlphaFold DB