The Updated Genome Warehouse: Enhancing Data Value, Security, and Usability to Address Data Expansion.

Yingke Ma, Xuetong Zhao, Yaokai Jia, Zhenxian Han, Caixia Yu, Zhuojing Fan, Zhang Zhang, Jingfa Xiao, Wenming Zhao, Yiming Bao, Meili Chen
Author Information
  1. Yingke Ma: National Genomics Data Center, China National Center for Bioinformation, Beijing 100101, China. ORCID
  2. Xuetong Zhao: National Genomics Data Center, China National Center for Bioinformation, Beijing 100101, China. ORCID
  3. Yaokai Jia: National Genomics Data Center, China National Center for Bioinformation, Beijing 100101, China. ORCID
  4. Zhenxian Han: National Genomics Data Center, China National Center for Bioinformation, Beijing 100101, China. ORCID
  5. Caixia Yu: National Genomics Data Center, China National Center for Bioinformation, Beijing 100101, China. ORCID
  6. Zhuojing Fan: National Genomics Data Center, China National Center for Bioinformation, Beijing 100101, China. ORCID
  7. Zhang Zhang: National Genomics Data Center, China National Center for Bioinformation, Beijing 100101, China. ORCID
  8. Jingfa Xiao: National Genomics Data Center, China National Center for Bioinformation, Beijing 100101, China. ORCID
  9. Wenming Zhao: National Genomics Data Center, China National Center for Bioinformation, Beijing 100101, China. ORCID
  10. Yiming Bao: National Genomics Data Center, China National Center for Bioinformation, Beijing 100101, China. ORCID
  11. Meili Chen: National Genomics Data Center, China National Center for Bioinformation, Beijing 100101, China. ORCID

Abstract

The Genome Warehouse (GWH), accessible at https://ngdc.cncb.ac.cn/gwh, is an extensively utilized public repository dedicated to the deposition, management and sharing of genome assembly sequences, annotations, and metadata. This paper highlights noteworthy enhancements to the GWH since the 2021 version, emphasizing substantial advancements in web interfaces for data submission, database functionality updates, and resource integration. Key updates include the reannotation of released prokaryotic genomes, mirroring of genome resources from National Center for Biotechnology Information (NCBI) GenBank and Reference Sequence Database (RefSeq), integration of Poxviridae sequences, implementation of an online batch submission system, enhancements to the quality control system, advanced search capabilities, and the introduction of a controlled-access mechanism for human genome data. These improvements collectively augment the ease and security of data submission and access as well as genome data value, thereby fostering heightened convenience and utility for researchers in the genomics field.

Keywords

Word Cloud

Similar Articles

Cited By