KoNA


38862433	KoNA: Korean Nucleotide Archive as A New Data Repository for Nucleotide Sequence Data. [PMID: 38862433] Gunhwan Ko, Jae Ho Lee, Young Mi Sim, Wangho Song, Byung-Ha Yoon, Iksu Byeon, Bang Hyuck Lee, Sang-Ok Kim, Jinhyuk Choi, Insoo Jang, Hyerin Kim, Jin Ok Yang, Kiwon Jang, Sora Kim, Jong-Hwan Kim, Jongbum Jeon, Jaeeun Jung, Seungwoo Hwang, Ji-Hwan Park, Pan-Gyu Kim, Seon-Young Kim, Byungwook Lee Abstract During the last decade, the generation and accumulation of petabase-scale high-throughput sequencing data have resulted in great challenges, including access to human data, as well as transfer, storage, and sharing of enormous amounts of data. To promote data-driven biological research, the Korean government announced that all biological data generated from government-funded research projects should be deposited at the Korea BioData Station (K-BDS), which consists of multiple databases for individual data types. Here, we introduce the Korean Nucleotide Archive (KoNA), a repository of nucleotide sequence data. As of July 2022, the Korean Read Archive in KoNA has collected over 477 TB of raw next-generation sequencing data from national genome projects. To ensure data quality and prepare for international alignment, a standard operating procedure was adopted, which is similar to that of the International Nucleotide Sequence Database Collaboration. The standard operating procedure includes quality control processes for submitted data and metadata using an automated pipeline, followed by manual examination. To ensure fast and stable data transfer, a high-speed transmission system called GBox is used in KoNA. Furthermore, the data uploaded to or downloaded from KoNA through GBox can be readily processed using a cloud computing service called Bio-Express. This seamless coupling of KoNA, GBox, and Bio-Express enhances the data experience, including submission, access, and analysis of raw nucleotide sequences. KoNA not only satisfies the unmet needs for a national sequence repository in Korea but also provides datasets to researchers globally and contributes to advances in genomics. The KoNA is available at https://www.kobic.re.kr/kona/. Genomics Proteomics Bioinformatics. 2024:22(1) \| 5 Citations (from Europe PMC, 2025-12-13)

KoNA: Korean Nucleotide Archive as A New Data Repository for Nucleotide Sequence Data. [PMID: 38862433]

Gunhwan Ko, Jae Ho Lee, Young Mi Sim, Wangho Song, Byung-Ha Yoon, Iksu Byeon, Bang Hyuck Lee, Sang-Ok Kim, Jinhyuk Choi, Insoo Jang, Hyerin Kim, Jin Ok Yang, Kiwon Jang, Sora Kim, Jong-Hwan Kim, Jongbum Jeon, Jaeeun Jung, Seungwoo Hwang, Ji-Hwan Park, Pan-Gyu Kim, Seon-Young Kim, Byungwook Lee

Abstract

During the last decade, the generation and accumulation of petabase-scale high-throughput sequencing data have resulted in great challenges, including access to human data, as well as transfer, storage, and sharing of enormous amounts of data. To promote data-driven biological research, the Korean government announced that all biological data generated from government-funded research projects should be deposited at the Korea BioData Station (K-BDS), which consists of multiple databases for individual data types. Here, we introduce the Korean Nucleotide Archive (KoNA), a repository of nucleotide sequence data. As of July 2022, the Korean Read Archive in KoNA has collected over 477 TB of raw next-generation sequencing data from national genome projects. To ensure data quality and prepare for international alignment, a standard operating procedure was adopted, which is similar to that of the International Nucleotide Sequence Database Collaboration. The standard operating procedure includes quality control processes for submitted data and metadata using an automated pipeline, followed by manual examination. To ensure fast and stable data transfer, a high-speed transmission system called GBox is used in KoNA. Furthermore, the data uploaded to or downloaded from KoNA through GBox can be readily processed using a cloud computing service called Bio-Express. This seamless coupling of KoNA, GBox, and Bio-Express enhances the data experience, including submission, access, and analysis of raw nucleotide sequences. KoNA not only satisfies the unmet needs for a national sequence repository in Korea but also provides datasets to researchers globally and contributes to advances in genomics. The KoNA is available at https://www.kobic.re.kr/kona/.

Genomics Proteomics Bioinformatics. 2024:22(1) | 5 Citations (from Europe PMC, 2025-12-13)

URL:	https://www.kobic.re.kr/kona
Full name:	Korean Nucleotide Archive
Description:	The Korean Nucleotide Archive (KoNA) is a repository of nucleotide sequence data, established to support the storage, access, and analysis of raw next-generation sequencing (NGS) data. KoNA uses a high-speed transmission system called GBox for fast and stable data transfer, and a cloud computing service named Bio-Express for data processing.
Year founded:	2024
Last update:	2024-05-09
Version:	v1.0
Accessibility:	Accessible
Country/Region:	Korea, Republic of

Data type:	DNA RNA
Data object:	Animal Archaea Bacteria Fungi Plant
Database category:	Gene genome and annotation Genotype phenotype and variation Metadata Raw bio-data
Major species:	Homo sapiens Arabidopsis thaliana Saccharomyces cerevisiae Escherichia coli Methanocaldococcus jannaschii
Keywords:	next-generation sequencing nucleotide sequence data genomics Korea BioData Station cloud computing multi-omics

University/Institution:	Korea Research Institute of Bioscience & Biotechnology
Address:
City:
Province/State:
Country/Region:	Korea, Republic of
Contact name (PI/Team):	Gunhwan Ko
Contact email (PI/Helpdesk):	kona@kribb.re.kr

Database Commons
a catalog of worldwide biological databases

a catalog of worldwide biological databases

Database Profile

General information

Classification & Tag

Contact information

Publications

Ranking

Community reviews

Word cloud

Tags

Related Databases

Record metadata

Database Commons a catalog of worldwide biological databases