BIG Search

BIG Search is a scalable text search engine built based on ElasticSearch (a highly scalable open-source full-text search and analytics engine based on Apache Lucene). It features cross-domain search and facilitates users to gain access to a wide range of biomedical data, not only from NGDC databases but also partner databases throughout the world.

e.g., PRJCA000126;SAMC000385;tp53;EGFR; human; KaKs_Calculator

97,272 records from 44 NGDC & Partner databases.

Database Records Number Description
GSA 3,914 Genome Sequence Archive
BioProject 1,047 Biological Project Library
BioSample 547 Biological Sample Library
iEKPD 323 Integrated annotations for Eukaryotic protein Kinases, protein Phosphatases & phosphoprotein-binding Domains
RMVar 257 RNA Modification associated variants database
EKPD 109 Eukaryotic Kinase and Phosphatase Database
AnimalTFDB 102 Animal Transcription Factor Database
HGD 34 Homologous Gene Database
MethBank 4.0 17 a database of DNA methylation across a variety of species
Gene Expression Nebulas 17 A data portal of transcriptomic profiles across multiple species
Ascancer Atlas 14 A comprehensive knowledgebase of alternative splicing in human cancers
NucMap 14 A database of genome-wide nucleosome positioning map across species.
GSA for Human 12 Genome Sequence Archive for Human
SEGreg 11 Database of specifically expressed genes and regulation
RhesusBase Genes 9
NODE 8 The National Omics Data Encyclopedia
MethBank SRMs 7 Methbank, Single-base Resolution Methylomes (SRMs)
VCG 7 Virtual Chinese Genome Database is a dynamic genome database of Chinese population.
BioCode 5 Archive Bioinformatics Codes for Open Source Projects
DEG 5 Database of Essential Genes
LeukemiaDB 4 LeukemiaDB collects 3068 samples in 188 leukemia-associated RNA-seq datasets from NCBI GEO and SRA.
EPSD 4 Eukaryotic Phosphorylation Site Database
hTFtarget 4 In this hTFtarget database, we collected comprehensive human TF ChIP-Seq data and customized an analysis workflow to identify reliable TF targets with taking epigenomic states into account
Database Commons 3 a curated catalogue of biological databases.
dbPAF 3 database of Phospho-sites in Animals and Fungi
lnCAR 3 lnCAR | A comprehensive resource for lncRNAs from Cancer Arrays
BBCancer 2 BBCancer: an expression atlas of blood-based biomarkers in the early diagnosis of cancers
CancerSEA 2 CancerSEA: a cancer single-cell state atlas
Cell Taxonomy 2 Cell Taxonomy is a curated repository of cell types with multifaceted characterization.
GenTree 2 GenTree, the time tree of genes along the evolutionary history
GVM 2 Genome Variation Map
Platelets expression atlas 2 Platelet Expression Atlas (PEA) is a comprehensive expression resource and functional analysis platform for human platelets
lncRNASNP2 2
Methbank CRMs 2 Methbank, Consensus Reference Methylomes (CRMs)
PLMD 2 Protein Lysine Modifications Database
PTMD 2 A database of human disease-associated post-translational modifications
DiseaseEnhancer 1 DiseaseEnhancer: a resource of human disease-associated enhancer catalog.
EWAS Atlas 1 A knowledgebase of epigenome-wide association studies
EWAS Data Hub 1 A data hub of DNA methylation array data and metadata
EDK 1 Editome Disease Knowledgebase
miRNASNP-v3 1 miRNASNP-v3 is a comprehensive database for SNPs and disease-related variations in miRNAs and miRNA targets
TWAS Atlas 1 Transcriptome-Wide Association Studies Atlas
OpenLB 90,754 Open Library of Bioscience
Database Records Number Description

Powered by EBISearch

Database Records Number Description

Powered by NCBI Entrez

Database Records Number Description

Powered by EBI AlphaFold DB