Accession PRJCA015080
Title MARS: The Master Database of All Possible RNA Sequences
Relevance Evolution
Data types fasta datafiles
Organisms root
Description Recent success of AlphaFold2 in protein structure prediction relied heavily on co-evolutionary information derived from homologous protein sequences found in the huge, integrated database of protein sequences (Big Fantastic Database). In contrast, the existing nucleotide databases were not consolidated to facilitate wider and deeper homology search. Here, we built a comprehensive database by including the noncoding RNA sequences from RNAcentral, the transcriptome assembly and metagenome assembly from MG-RAST, the genomic sequences from Genome Warehouse (GWH), and the genomic sequences from MGnify, in addition to NCBI's nucleotide database (nt) and its subsets.
Sample scope no targets
Release date 2023-02-20
Publication
PubMed ID Article title Journal name DOI Year
10.1101/2023.02.01.526559
MARS and RNAcmap3: The Master Database of All Possible RNA Sequences Integrated with RNAcmap for RNA Homology Search Genomics, Proteomics & Bioinformatics 10.1093/gpbjnl/qzae018 2024
Grants
Agency program Grant ID Grant title
No funding support
Submitter Ke Chen (chenchk012@163.com)
Organization Shenzhen Bay Laboratory
Submission date 2023-02-20

Project Data

Resource name Description