BIG Search

BIG Search is a scalable text search engine built based on ElasticSearch (a highly scalable open-source full-text search and analytics engine based on Apache Lucene). It features cross-domain search and facilitates users to gain access to a wide range of biomedical data, not only from NGDC databases but also partner databases throughout the world.

e.g., PRJCA000126;SAMC000385;tp53;EGFR; human; KaKs_Calculator

42,664,458 records from 57 NGDC & Partner databases.

Database Records Number Description
GVM 16,739,720 Genome Variation Map
BioSample 8,173,301 Biological Sample Library
lncRNASNP2 4,443,771
RMVar 1,615,252 RNA Modification associated variants database
GSA 1,449,508 Genome Sequence Archive
circAltas 610,406 circAtlas 2.0
EWAS Data Hub 597,253 A data hub of DNA methylation array data and metadata
LncBook 409,204 A curated knowledgebase of human long non-coding RNAs.
EWAS Atlas 262,089 A knowledgebase of epigenome-wide association studies
BBCancer 137,210 BBCancer: an expression atlas of blood-based biomarkers in the early diagnosis of cancers
LncExpDB 101,293 Expression Database of Human Long non-coding RNAs
BioProject 67,449 Biological Project Library
Gene Expression Nebulas 64,158 A data portal of transcriptomic profiles across multiple species
GenTree 63,151 GenTree, the time tree of genes along the evolutionary history
MethBank 4.0 61,408 a database of DNA methylation across a variety of species
MethBank SRMs 60,479 Methbank, Single-base Resolution Methylomes (SRMs)
Methbank CRMs 60,415 Methbank, Consensus Reference Methylomes (CRMs)
SEGreg 53,156 Database of specifically expressed genes and regulation
VCG 43,801 Virtual Chinese Genome Database is a dynamic genome database of Chinese population.
HGD 42,901 Homologous Gene Database
CancerSEA 34,227 CancerSEA: a cancer single-cell state atlas
EPSD 30,679 Eukaryotic Phosphorylation Site Database
DEG 28,458 Database of Essential Genes
lnCAR 28,420 lnCAR | A comprehensive resource for lncRNAs from Cancer Arrays
dbPAF 18,792 database of Phospho-sites in Animals and Fungi
AnimalTFDB 8,266 Animal Transcription Factor Database
GWH 7,757 Genome Warehouse
ZCURVE_CoVdb 7,054 Database of Essential Genes
GSA for Human 2,818 Genome Sequence Archive for Human
Database Commons 840 a curated catalogue of biological databases.
BioCode 641 Archive Bioinformatics Codes for Open Source Projects
PTMD 594 A database of human disease-associated post-translational modifications
Brain Catalog 517 a One-Stop Shop for Brain-related Traits
CellMarker 467 CellMarker: a manually curated resource of cell markers in human and mouse.
RhesusBase Genes 206
Ascancer Atlas 205 A comprehensive knowledgebase of alternative splicing in human cancers
EDK 110 Editome Disease Knowledgebase
NODE 61 The National Omics Data Encyclopedia
eLMSG 59 An eLibrary of Microbial Systematics and Genomics
iPCD 54 database of PCD regulators
Cell Taxonomy 39 Cell Taxonomy is a curated repository of cell types with multifaceted characterization.
iEKPD 29 Integrated annotations for Eukaryotic protein Kinases, protein Phosphatases & phosphoprotein-binding Domains
ICG 27 internal control genes
PLMD 26 Protein Lysine Modifications Database
hTFtarget 13 In this hTFtarget database, we collected comprehensive human TF ChIP-Seq data and customized an analysis workflow to identify reliable TF targets with taking epigenomic states into account
CGGA 11 Chinese Glioma Genome Atlas
CGDB 6 Circadian Gene Database
TCOD 4 A multi-omics data platform for tropical crops
VarClear 3 Gene Variation Interpretation Database
KGCoV 3 KGCoV(Knowledge Graph of SARS-CoV-2) structures and matches COVID-19 epidemiological information and SARS-CoV-2 genomic data with combined curation methods, and integrates variation information generated by bioinformatic tools.
DoriC 2 Database of Replication Origins
iUUCD 2 integrated annotations for Ubiquitin and Ubiquitin-like Conjugation Database
GTDB 1 Glycosyltransferases Database
GWAS Atlas 1 GWAS Atlas is a curated resource of genome-wide variant-trait associations
OpenLB 4,557,859 Open Library of Bioscience
RCoV19 2,880,173 Resource for Coronavirus 2019
Database Records Number Description

Powered by EBISearch

Database Records Number Description

Powered by NCBI Entrez

Database Records Number Description

Powered by EBI AlphaFold DB