[GSA: Genome Sequence Archive].

Si Si Zhang, Ting Ting Chen, Jun Wei Zhu, Qing Zhou, Xu Chen, Yan Qing Wang, Wen Ming Zhao
Author Information
  1. Si Si Zhang: BIG Data Center, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, China.
  2. Ting Ting Chen: BIG Data Center, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, China.
  3. Jun Wei Zhu: BIG Data Center, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, China.
  4. Qing Zhou: BIG Data Center, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, China.
  5. Xu Chen: BIG Data Center, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, China.
  6. Yan Qing Wang: BIG Data Center, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, China.
  7. Wen Ming Zhao: BIG Data Center, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, China.

Abstract

The Genome Sequence Archive (GSA), a new data repository for raw sequence reads in China, has been developed in compliance with the International Nucleotide Sequence Database Collaboration (INSDC) standards. It supports data generated from a variety of sequencing platforms ranging from Sanger sequencing to single-cell sequencing and provides data storing and sharing services freely for worldwide scientific communities. Since it went online in late 2015, GSA has archived more than 500 TB data and been acknowledged by many high-profile journals, including Cell, Nature, PNAS, GPB, etc. Focusing on omics data submission, storing and sharing typically for Chinese users, GSA promotes the initiative of the National Bioinformatics Center of China. This paper introduces the specifies of GSA as data collection, curation, management and exchange to facilitate users to understand and use GSA database.

MeSH Term

China
Computational Biology
Data Curation
Databases, Nucleic Acid
Genomics
High-Throughput Nucleotide Sequencing
Online Systems

Word Cloud

Created with Highcharts 10.0.0dataGSASequencesequencingGenomeChinastoringsharingusersArchivenewrepositoryrawsequencereadsdevelopedcomplianceInternationalNucleotideDatabaseCollaborationINSDCstandardssupportsgeneratedvarietyplatformsrangingSangersingle-cellprovidesservicesfreelyworldwidescientificcommunitiesSincewentonlinelate2015archived500TBacknowledgedmanyhigh-profilejournalsincludingCellNaturePNASGPBetcFocusingomicssubmissiontypicallyChinesepromotesinitiativeNationalBioinformaticsCenterpaperintroducesspecifiescollectioncurationmanagementexchangefacilitateunderstandusedatabase[GSA:Archive]

Similar Articles

Cited By (5)