BIG Search is a scalable text search engine built based on ElasticSearch (a highly scalable open-source full-text search and analytics engine based on Apache Lucene). It features cross-domain search and facilitates users to gain access to a wide range of biomedical data, not only from NGDC databases but also partner databases throughout the world.
42,664,458 records from 57 NGDC & Partner databases.
Database | Records Number | Description |
---|---|---|
GVM | 16,739,720 | Genome Variation Map |
BioSample | 8,173,301 | Biological Sample Library |
lncRNASNP2 | 4,443,771 | |
RMVar | 1,615,252 | RNA Modification associated variants database |
GSA | 1,449,508 | Genome Sequence Archive |
circAltas | 610,406 | circAtlas 2.0 |
EWAS Data Hub | 597,253 | A data hub of DNA methylation array data and metadata |
LncBook | 409,204 | A curated knowledgebase of human long non-coding RNAs. |
EWAS Atlas | 262,089 | A knowledgebase of epigenome-wide association studies |
BBCancer | 137,210 | BBCancer: an expression atlas of blood-based biomarkers in the early diagnosis of cancers |
LncExpDB | 101,293 | Expression Database of Human Long non-coding RNAs |
BioProject | 67,449 | Biological Project Library |
Gene Expression Nebulas | 64,158 | A data portal of transcriptomic profiles across multiple species |
GenTree | 63,151 | GenTree, the time tree of genes along the evolutionary history |
MethBank 4.0 | 61,408 | a database of DNA methylation across a variety of species |
MethBank SRMs | 60,479 | Methbank, Single-base Resolution Methylomes (SRMs) |
Methbank CRMs | 60,415 | Methbank, Consensus Reference Methylomes (CRMs) |
SEGreg | 53,156 | Database of specifically expressed genes and regulation |
VCG | 43,801 | Virtual Chinese Genome Database is a dynamic genome database of Chinese population. |
HGD | 42,901 | Homologous Gene Database |
CancerSEA | 34,227 | CancerSEA: a cancer single-cell state atlas |
EPSD | 30,679 | Eukaryotic Phosphorylation Site Database |
DEG | 28,458 | Database of Essential Genes |
lnCAR | 28,420 | lnCAR | A comprehensive resource for lncRNAs from Cancer Arrays |
dbPAF | 18,792 | database of Phospho-sites in Animals and Fungi |
AnimalTFDB | 8,266 | Animal Transcription Factor Database |
GWH | 7,757 | Genome Warehouse |
ZCURVE_CoVdb | 7,054 | Database of Essential Genes |
GSA for Human | 2,818 | Genome Sequence Archive for Human |
Database Commons | 840 | a curated catalogue of biological databases. |
BioCode | 641 | Archive Bioinformatics Codes for Open Source Projects |
PTMD | 594 | A database of human disease-associated post-translational modifications |
Brain Catalog | 517 | a One-Stop Shop for Brain-related Traits |
CellMarker | 467 | CellMarker: a manually curated resource of cell markers in human and mouse. |
RhesusBase Genes | 206 | |
Ascancer Atlas | 205 | A comprehensive knowledgebase of alternative splicing in human cancers |
EDK | 110 | Editome Disease Knowledgebase |
OMIX | 109 | OMIX |
NODE | 61 | The National Omics Data Encyclopedia |
eLMSG | 59 | An eLibrary of Microbial Systematics and Genomics |
iPCD | 54 | database of PCD regulators |
Cell Taxonomy | 39 | Cell Taxonomy is a curated repository of cell types with multifaceted characterization. |
iEKPD | 29 | Integrated annotations for Eukaryotic protein Kinases, protein Phosphatases & phosphoprotein-binding Domains |
ICG | 27 | internal control genes |
PLMD | 26 | Protein Lysine Modifications Database |
hTFtarget | 13 | In this hTFtarget database, we collected comprehensive human TF ChIP-Seq data and customized an analysis workflow to identify reliable TF targets with taking epigenomic states into account |
CGGA | 11 | Chinese Glioma Genome Atlas |
CGDB | 6 | Circadian Gene Database |
TCOD | 4 | A multi-omics data platform for tropical crops |
VarClear | 3 | Gene Variation Interpretation Database |
KGCoV | 3 | KGCoV(Knowledge Graph of SARS-CoV-2) structures and matches COVID-19 epidemiological information and SARS-CoV-2 genomic data with combined curation methods, and integrates variation information generated by bioinformatic tools. |
DoriC | 2 | Database of Replication Origins |
iUUCD | 2 | integrated annotations for Ubiquitin and Ubiquitin-like Conjugation Database |
GTDB | 1 | Glycosyltransferases Database |
GWAS Atlas | 1 | GWAS Atlas is a curated resource of genome-wide variant-trait associations |
OpenLB | 4,557,859 | Open Library of Bioscience |
RCoV19 | 2,880,173 | Resource for Coronavirus 2019 |
Database | Records Number | Description |
---|
Powered by EBISearch
Database | Records Number | Description |
---|
Powered by NCBI Entrez
Database | Records Number | Description |
---|
Powered by EBI AlphaFold DB