Cell Taxonomy: a curated repository of cell types with multifaceted characterization.

Shuai Jiang, Qiheng Qian, Tongtong Zhu, Wenting Zong, Yunfei Shang, Tong Jin, Yuansheng Zhang, Ming Chen, Zishan Wu, Yuan Chu, Rongqin Zhang, Sicheng Luo, Wei Jing, Dong Zou, Yiming Bao, Jingfa Xiao, Zhang Zhang
Author Information
  1. Shuai Jiang: National Genomics Data Center & CAS Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, China. ORCID
  2. Qiheng Qian: National Genomics Data Center & CAS Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, China.
  3. Tongtong Zhu: National Genomics Data Center & CAS Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, China.
  4. Wenting Zong: National Genomics Data Center & CAS Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, China.
  5. Yunfei Shang: National Genomics Data Center & CAS Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, China.
  6. Tong Jin: National Genomics Data Center & CAS Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, China.
  7. Yuansheng Zhang: National Genomics Data Center & CAS Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, China.
  8. Ming Chen: National Genomics Data Center & CAS Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, China.
  9. Zishan Wu: National Genomics Data Center & CAS Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, China.
  10. Yuan Chu: National Genomics Data Center & CAS Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, China.
  11. Rongqin Zhang: National Genomics Data Center & CAS Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, China.
  12. Sicheng Luo: National Genomics Data Center & CAS Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, China.
  13. Wei Jing: National Genomics Data Center & CAS Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, China.
  14. Dong Zou: National Genomics Data Center & CAS Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, China.
  15. Yiming Bao: National Genomics Data Center & CAS Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, China.
  16. Jingfa Xiao: National Genomics Data Center & CAS Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, China. ORCID
  17. Zhang Zhang: National Genomics Data Center & CAS Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, China. ORCID

Abstract

Single-cell studies have delineated cellular diversity and uncovered increasing numbers of previously uncharacterized cell types in complex tissues. Thus, synthesizing growing knowledge of cellular characteristics is critical for dissecting cellular heterogeneity, developmental processes and tumorigenesis at single-cell resolution. Here, we present Cell Taxonomy (https://ngdc.cncb.ac.cn/celltaxonomy), a comprehensive and curated repository of cell types and associated cell markers encompassing a wide range of species, tissues and conditions. Combined with literature curation and data integration, the current version of Cell Taxonomy establishes a well-structured taxonomy for 3,143 cell types and houses a comprehensive collection of 26,613 associated cell markers in 257 conditions and 387 tissues across 34 species. Based on 4,299 publications and single-cell transcriptomic profiles of ∼3.5 million cells, Cell Taxonomy features multifaceted characterization for cell types and cell markers, involving quality assessment of cell markers and cell clusters, cross-species comparison, cell composition of tissues and cellular similarity based on markers. Taken together, Cell Taxonomy represents a fundamentally useful reference to systematically and accurately characterize cell types and thus lays an important foundation for deeply understanding and exploring cellular biology in diverse species.

References

  1. Elife. 2017 Dec 05;6: [PMID: 29206104]
  2. Cell. 2015 May 21;161(5):1202-1214 [PMID: 26000488]
  3. Database (Oxford). 2019 Jan 1;2019: [PMID: 30951143]
  4. Cell. 2019 Oct 31;179(4):829-845.e20 [PMID: 31675496]
  5. Gastroenterology. 2021 Mar;160(4):1330-1344.e11 [PMID: 33212097]
  6. Cell. 2021 Jun 24;184(13):3573-3587.e29 [PMID: 34062119]
  7. Science. 2022 May 13;376(6594):eabl4896 [PMID: 35549404]
  8. Nucleic Acids Res. 2013 Jan;41(Database issue):D991-5 [PMID: 23193258]
  9. Nat Biotechnol. 2018 Jun;36(5):411-420 [PMID: 29608179]
  10. Mol Cell. 2015 May 21;58(4):598-609 [PMID: 26000845]
  11. Nat Commun. 2017 Jan 16;8:14049 [PMID: 28091601]
  12. Nat Biotechnol. 2015 May;33(5):495-502 [PMID: 25867923]
  13. Nat Protoc. 2014 Jan;9(1):171-81 [PMID: 24385147]
  14. Genes Dev. 2011 Sep 15;25(18):1915-27 [PMID: 21890647]
  15. Science. 2015 Jan 23;347(6220):1260419 [PMID: 25613900]
  16. Nature. 2020 May;581(7808):303-309 [PMID: 32214235]
  17. Proc Natl Acad Sci U S A. 2018 Nov 13;115(46):E10988-E10997 [PMID: 30373828]
  18. Nat Commun. 2021 Sep 21;12(1):5556 [PMID: 34548483]
  19. Cell. 2021 Feb 4;184(3):792-809.e23 [PMID: 33545035]
  20. Science. 2018 Nov 30;362(6418):1060-1063 [PMID: 30498128]
  21. Nat Commun. 2020 Jul 10;11(1):3458 [PMID: 32651388]
  22. Nucleic Acids Res. 2019 Sep 5;47(15):7842-7856 [PMID: 31350901]
  23. Mol Cell. 2015 May 21;58(4):610-20 [PMID: 26000846]
  24. Science. 2022 Jun 3;376(6597):eabo0510 [PMID: 35549310]
  25. Science. 2022 May 13;376(6594):eabl4290 [PMID: 35549429]
  26. Nat Commun. 2020 Jun 22;11(1):3155 [PMID: 32572028]
  27. Nucleic Acids Res. 2019 Jan 8;47(D1):D721-D728 [PMID: 30289549]
  28. Genome Biol. 2012 Jan 31;13(1):R5 [PMID: 22293552]
  29. Nature. 2021 Feb;590(7846):473-479 [PMID: 33408417]
  30. J Biomed Semantics. 2016 Jul 04;7(1):44 [PMID: 27377652]
  31. iScience. 2020 Mar 27;23(3):100882 [PMID: 32062421]
  32. Cell. 2020 Apr 16;181(2):442-459.e29 [PMID: 32302573]
  33. Nucleic Acids Res. 2021 Jan 8;49(D1):D412-D419 [PMID: 33125078]
  34. Comput Struct Biotechnol J. 2021 Jan 19;19:961-969 [PMID: 33613863]
  35. BMC Bioinformatics. 2020 Aug 4;21(1):342 [PMID: 32753029]
  36. Nucleic Acids Res. 2013 Jan;41(Database issue):D1241-50 [PMID: 23203874]
  37. Cell. 2018 Feb 22;172(5):1091-1107.e17 [PMID: 29474909]
  38. Cell. 2019 Jun 13;177(7):1888-1902.e21 [PMID: 31178118]
  39. Science. 2022 May 13;376(6594):eabl5197 [PMID: 35549406]
  40. Nucleic Acids Res. 2021 Jan 8;49(D1):D480-D489 [PMID: 33237286]
  41. Exp Mol Med. 2018 Aug 7;50(8):1-14 [PMID: 30089861]
  42. Nucleic Acids Res. 2022 Jan 7;50(D1):D1255-D1261 [PMID: 34755882]
  43. Nucleic Acids Res. 2022 Jan 7;50(D1):D988-D995 [PMID: 34791404]
  44. Nature. 2018 Oct;562(7727):367-372 [PMID: 30283141]
  45. Cell Stem Cell. 2018 Aug 02;23(2):166-179 [PMID: 29754780]
  46. Nucleic Acids Res. 2014 Jan;42(Database issue):D950-8 [PMID: 24304896]
  47. Nucleic Acids Res. 2018 Jan 4;46(D1):D1074-D1082 [PMID: 29126136]
  48. Database (Oxford). 2011 Oct 29;2011:bar046 [PMID: 22039163]
  49. Nat Methods. 2017 Apr;14(4):395-398 [PMID: 28192419]
  50. Genomics Proteomics Bioinformatics. 2021 Jun;19(3):343-345 [PMID: 34923125]
  51. Nat Rev Nephrol. 2018 Aug;14(8):479-492 [PMID: 29789704]

Grants

  1. XDA19050302/Chinese Academy of Sciences
  2. 2018134/Youth Innovation Promotion Association of the Chinese Academy of Sciences
  3. 2020YFA0907001/National Key Research and Development Program of China
  4. 153F11KYSB20160008/Chinese Academy of Sciences
  5. 32100520/National Natural Science Foundation of China
  6. /The Open Biodiversity and Health Big Data Programme of IUBS

Links to CNCB-NGDC Resources

Database Commons: DBC007421 (Cell Taxonomy)

Word Cloud

Similar Articles

Cited By