Database Commons
Database Commons

a catalog of worldwide biological databases

Database Profile

GenBase

General information

URL: https://ngdc.cncb.ac.cn/genbase/
Full name: Genetic Sequence Database
Description: GenBase is a genetic sequence database that accepts user submissions and integrates data from INSDC.
Year founded: 2022
Last update: 2024
Version: 1.0
Accessibility:
Accessible
Country/Region: China

Funding support

  • 2021YFF0703700

Classification & Tag

Data type:
Data object:
Database category:
Major species:
NA
Keywords:

Contact information

University/Institution: Beijing Institute of Genomics, Chinese Academy of Sciences
Address: No.1 Beichen West Road, Chaoyang District
City: Beijing
Province/State: Beijing
Country/Region: China
Contact name (PI/Team): Yiming Bao
Contact email (PI/Helpdesk): baoym@big.ac.cn

Publications

38913867
GenBase: A Nucleotide Sequence Database. [PMID: 38913867]
Bu C, Zheng X, Zhao X, Xu T, Bai X, Jia Y, Chen M, Hao L, Xiao J, Zhang Z, Zhao W, Tang B, Bao Y.

The rapid advancement of sequencing technologies poses challenges in managing the large volume and exponential growth of sequence data efficiently and on time. To address this issue, we present GenBase (https://ngdc.cncb.ac.cn/genbase), an open-access data repository that follows the International Nucleotide Sequence Database Collaboration (INSDC) data standards and structures, for efficient nucleotide sequence archiving, searching, and sharing. As a core resource within the National Genomics Data Center (NGDC), of the China National Center for Bioinformation (CNCB; https://ngdc.cncb.ac.cn), GenBase offers bilingual submission pipeline and services, as well as local submission assistance in China. GenBase also provides a unique Excel format for metadata description and feature annotation of nucleotide sequences, along with a real-time data validation system to streamline sequence submissions. As of April 23, 2024, GenBase received 68,251 nucleotide sequences and 689,574 annotated protein sequences across 414 species from 2319 submissions. Out of these, 63,614 (93%) nucleotide sequences and 620,640 (90%) annotated protein sequences have been released and are publicly accessible through GenBase's web search system, File Transfer Protocol (FTP), and Application Programming Interface (API). Additionally, in collaboration with INSDC, GenBase has constructed an effective data exchange mechanism with GenBank and started sharing released nucleotide sequences. Furthermore, GenBase integrates all sequences from GenBank with daily updates, demonstrating its commitment to actively contributing to global sequence data management and sharing.

Genomics Proteomics Bioinformatics. 2024:22(3) | 20 Citations (from Europe PMC, 2025-12-13)

Ranking

All databases:
930/6895 (86.526%)
Gene genome and annotation:
309/2021 (84.76%)
930
Total Rank
16
Citations
16
z-index

Community reviews

0 Stars (1)
Data quality & quantity:
Content organization & presentation
System accessibility & reliability:

Word cloud

Related Databases

Citing
Cited by

Record metadata

Created on: 2022-12-08
Curated by:
Dong Zou [2025-02-25]
Miaomiao Wang [2024-07-15]
Tianyi Xu [2023-01-06]
Dong Zou [2022-12-08]
Tianyi Xu [2022-12-08]