The Updated Genome Warehouse: Enhancing Data Value, Security, and Usability to Address Data Expansion.

Yingke Ma, Xuetong Zhao, Yaokai Jia, Zhenxian Han, Caixia Yu, Zhuojing Fan, Zhang Zhang, Jingfa Xiao, Wenming Zhao, Yiming Bao, Meili Chen
Author Information
  1. Yingke Ma: National Genomics Data Center, China National Center for Bioinformation, Beijing 100101, China. ORCID
  2. Xuetong Zhao: National Genomics Data Center, China National Center for Bioinformation, Beijing 100101, China. ORCID
  3. Yaokai Jia: National Genomics Data Center, China National Center for Bioinformation, Beijing 100101, China. ORCID
  4. Zhenxian Han: National Genomics Data Center, China National Center for Bioinformation, Beijing 100101, China. ORCID
  5. Caixia Yu: National Genomics Data Center, China National Center for Bioinformation, Beijing 100101, China. ORCID
  6. Zhuojing Fan: National Genomics Data Center, China National Center for Bioinformation, Beijing 100101, China. ORCID
  7. Zhang Zhang: National Genomics Data Center, China National Center for Bioinformation, Beijing 100101, China. ORCID
  8. Jingfa Xiao: National Genomics Data Center, China National Center for Bioinformation, Beijing 100101, China. ORCID
  9. Wenming Zhao: National Genomics Data Center, China National Center for Bioinformation, Beijing 100101, China. ORCID
  10. Yiming Bao: National Genomics Data Center, China National Center for Bioinformation, Beijing 100101, China. ORCID
  11. Meili Chen: National Genomics Data Center, China National Center for Bioinformation, Beijing 100101, China. ORCID

Abstract

The Genome Warehouse (GWH), accessible at https://ngdc.cncb.ac.cn/gwh, is an extensively-utilized public repository dedicated to the deposition, management, and sharing of genome assembly sequences, annotations, and metadata. This paper highlights noteworthy enhancements to the GWH since the 2021 version, emphasizing substantial advancements in web interfaces for data submission, database functionality updates, and resource integration. Key updates include the reannotation of released prokaryotic genomes, mirroring of genome resources from National Center for Biotechnology Information (NCBI) GenBank and Reference Sequence Database (RefSeq), integration of Poxviridae sequences, implementation of an online batch submission system, enhancements to the quality control system, advanced search capabilities, and the introduction of a controlled-access mechanism for human genome data. These improvements collectively enhance the ease and security of data submission and access as well as genome data value, thereby improving convenience and utility for researchers in the genomics field.

Keywords

References

Nucleic Acids Res. 2024 Jan 5;52(D1):D762-D769 [PMID: 37962425]
Genomics Proteomics Bioinformatics. 2021 Aug;19(4):584-589 [PMID: 34175476]
Nucleic Acids Res. 2022 Jan 7;50(D1):D785-D794 [PMID: 34520557]
Genomics Proteomics Bioinformatics. 2021 Aug;19(4):578-583 [PMID: 34400360]
Nucleic Acids Res. 2024 Jan 5;52(D1):D747-D755 [PMID: 37930867]
Nucleic Acids Res. 2016 Jan 4;44(D1):D733-45 [PMID: 26553804]
Nucleic Acids Res. 2021 Jan 8;49(D1):D121-D124 [PMID: 33166387]
Nucleic Acids Res. 2024 Jan 5;52(D1):D891-D899 [PMID: 37953337]
Nucleic Acids Res. 2023 Jan 6;51(D1):D141-D144 [PMID: 36350640]
Nucleic Acids Res. 2024 Jan 5;52(D1):D18-D32 [PMID: 38018256]
Genomics Proteomics Bioinformatics. 2023 Oct;21(5):900-903 [PMID: 37832784]
Nucleic Acids Res. 2024 Jan 5;52(D1):D33-D43 [PMID: 37994677]
Innovation (Camb). 2022 Aug 01;3(5):100296 [PMID: 36039088]

MeSH Term

Humans
Databases, Genetic
Genomics
Computer Security
Genome, Human

Links to CNCB-NGDC Resources

Database Commons: DBC006012 (GWH)

Word Cloud

Similar Articles

Cited By