Database Commons: A Catalog of Worldwide Biological Databases.

Lina Ma, Dong Zou, Lin Liu, Huma Shireen, Amir A Abbasi, Alex Bateman, Jingfa Xiao, Wenming Zhao, Yiming Bao, Zhang Zhang
Author Information
  1. Lina Ma: National Genomics Data Center, Beijing Institute of Genomics, Chinese Academy of Sciences and China National Center for Bioinformation, Beijing 100101, China; CAS Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics, Chinese Academy of Sciences and China National Center for Bioinformation, Beijing 100101, China; University of Chinese Academy of Sciences, Beijing 100049, China. Electronic address: malina@big.ac.cn.
  2. Dong Zou: National Genomics Data Center, Beijing Institute of Genomics, Chinese Academy of Sciences and China National Center for Bioinformation, Beijing 100101, China; CAS Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics, Chinese Academy of Sciences and China National Center for Bioinformation, Beijing 100101, China.
  3. Lin Liu: National Genomics Data Center, Beijing Institute of Genomics, Chinese Academy of Sciences and China National Center for Bioinformation, Beijing 100101, China; CAS Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics, Chinese Academy of Sciences and China National Center for Bioinformation, Beijing 100101, China.
  4. Huma Shireen: National Center for Bioinformatics, Programme of Comparative and Evolutionary Genomics, Faculty of Biological Sciences, Quaid-i-Azam University, Islamabad 45320, Pakistan.
  5. Amir A Abbasi: National Center for Bioinformatics, Programme of Comparative and Evolutionary Genomics, Faculty of Biological Sciences, Quaid-i-Azam University, Islamabad 45320, Pakistan.
  6. Alex Bateman: European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Hinxton, Cambridge CB10 1SD, United Kingdom.
  7. Jingfa Xiao: National Genomics Data Center, Beijing Institute of Genomics, Chinese Academy of Sciences and China National Center for Bioinformation, Beijing 100101, China; CAS Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics, Chinese Academy of Sciences and China National Center for Bioinformation, Beijing 100101, China; University of Chinese Academy of Sciences, Beijing 100049, China.
  8. Wenming Zhao: National Genomics Data Center, Beijing Institute of Genomics, Chinese Academy of Sciences and China National Center for Bioinformation, Beijing 100101, China; CAS Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics, Chinese Academy of Sciences and China National Center for Bioinformation, Beijing 100101, China; University of Chinese Academy of Sciences, Beijing 100049, China.
  9. Yiming Bao: National Genomics Data Center, Beijing Institute of Genomics, Chinese Academy of Sciences and China National Center for Bioinformation, Beijing 100101, China; CAS Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics, Chinese Academy of Sciences and China National Center for Bioinformation, Beijing 100101, China; University of Chinese Academy of Sciences, Beijing 100049, China.
  10. Zhang Zhang: National Genomics Data Center, Beijing Institute of Genomics, Chinese Academy of Sciences and China National Center for Bioinformation, Beijing 100101, China; CAS Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics, Chinese Academy of Sciences and China National Center for Bioinformation, Beijing 100101, China; University of Chinese Academy of Sciences, Beijing 100049, China. Electronic address: zhangzhang@big.ac.cn.

Abstract

Biological databases serve as a global fundamental infrastructure for the worldwide scientific community, which dramatically aid the transformation of big data into knowledge discovery and drive significant innovations in a wide range of research fields. Given the rapid data production, biological databases continue to increase in size and importance. To build a catalog of worldwide biological databases, we curate a total of 5825 biological databases from 8931 publications, which are geographically distributed in 72 countries/regions and developed by 1975 institutions (as of September 20, 2022). We further devise a z-index, a novel index to characterize the scientific impact of a database, and rank all these biological databases as well as their hosting institutions and countries in terms of citation and z-index. Consequently, we present a series of statistics and trends of worldwide biological databases, yielding a global perspective to better understand their status and impact for life and health sciences. An up-to-date catalog of worldwide biological databases, as well as their curated meta-information and derived statistics, is publicly available at Database Commons (https://ngdc.cncb.ac.cn/databasecommons/).

Keywords

References

  1. Nucleic Acids Res. 2022 Jan 7;50(D1):D11-D19 [PMID: 34850134]
  2. Nature. 2011 Feb 10;470(7333):295-6 [PMID: 21348148]
  3. Nature. 2012 Sep 6;489(7414):19 [PMID: 22962702]
  4. Nucleic Acids Res. 2022 Jan 7;50(D1):D20-D26 [PMID: 34850941]
  5. Bioinformatics. 2008 Oct 1;24(19):2127-8 [PMID: 18819940]
  6. PLoS Biol. 2018 Apr 16;16(4):e2002846 [PMID: 29659566]
  7. Nat Rev Genet. 2003 May;4(5):337-45 [PMID: 12728276]
  8. Nucleic Acids Res. 2022 Jan 7;50(D1):D27-D38 [PMID: 34718731]
  9. Front Bioeng Biotechnol. 2019 Apr 05;7:58 [PMID: 31024904]
  10. Nucleic Acids Res. 2016 Jan 4;44(D1):D27-37 [PMID: 26615188]

MeSH Term

Databases, Factual
Big Data

Word Cloud

Similar Articles

Cited By