LINEAGE: Label-free identification of endogenous informative single-cell mitochondrial RNA mutation for lineage analysis.

Li Lin, Yufeng Zhang, Weizhou Qian, Yao Liu, Yingkun Zhang, Fanghe Lin, Cenxi Liu, Guangxing Lu, Di Sun, Xiaoxu Guo, YanLing Song, Jia Song, Chaoyong Yang, Jin Li
Author Information
  1. Li Lin: State Key Laboratory for Physical Chemistry of Solid Surfaces, Key Laboratory for Chemical Biology of Fujian Province, Key Laboratory of Analytical Chemistry, and Department of Chemical Biology, College of Chemistry and Chemical Engineering, Xiamen University, Xiamen 361005, People's Republic of China. ORCID
  2. Yufeng Zhang: State Key Laboratory of Genetic Engineering and School of Life Sciences, Fudan University, Shanghai 200433, China.
  3. Weizhou Qian: State Key Laboratory for Physical Chemistry of Solid Surfaces, Key Laboratory for Chemical Biology of Fujian Province, Key Laboratory of Analytical Chemistry, and Department of Chemical Biology, College of Chemistry and Chemical Engineering, Xiamen University, Xiamen 361005, People's Republic of China. ORCID
  4. Yao Liu: Department of Endocrinology and Metabolism, Shanghai Tenth People's Hospital, School of Medicine, Tongji University, Shanghai 200072, China.
  5. Yingkun Zhang: State Key Laboratory for Physical Chemistry of Solid Surfaces, Key Laboratory for Chemical Biology of Fujian Province, Key Laboratory of Analytical Chemistry, and Department of Chemical Biology, College of Chemistry and Chemical Engineering, Xiamen University, Xiamen 361005, People's Republic of China. ORCID
  6. Fanghe Lin: State Key Laboratory for Physical Chemistry of Solid Surfaces, Key Laboratory for Chemical Biology of Fujian Province, Key Laboratory of Analytical Chemistry, and Department of Chemical Biology, College of Chemistry and Chemical Engineering, Xiamen University, Xiamen 361005, People's Republic of China.
  7. Cenxi Liu: State Key Laboratory of Genetic Engineering and School of Life Sciences, Fudan University, Shanghai 200433, China.
  8. Guangxing Lu: State Key Laboratory of Genetic Engineering and School of Life Sciences, Fudan University, Shanghai 200433, China.
  9. Di Sun: Institute of Molecular Medicine, Renji Hospital, School of Medicine, Shanghai Jiao Tong University, Shanghai 200127, China.
  10. Xiaoxu Guo: State Key Laboratory for Physical Chemistry of Solid Surfaces, Key Laboratory for Chemical Biology of Fujian Province, Key Laboratory of Analytical Chemistry, and Department of Chemical Biology, College of Chemistry and Chemical Engineering, Xiamen University, Xiamen 361005, People's Republic of China.
  11. YanLing Song: State Key Laboratory for Physical Chemistry of Solid Surfaces, Key Laboratory for Chemical Biology of Fujian Province, Key Laboratory of Analytical Chemistry, and Department of Chemical Biology, College of Chemistry and Chemical Engineering, Xiamen University, Xiamen 361005, People's Republic of China. ORCID
  12. Jia Song: Institute of Molecular Medicine, Renji Hospital, School of Medicine, Shanghai Jiao Tong University, Shanghai 200127, China; li_jin_lifescience@fudan.edu.cn songjiajia2010@shsmu.edu.cn cyyang@xmu.edu.cn.
  13. Chaoyong Yang: State Key Laboratory for Physical Chemistry of Solid Surfaces, Key Laboratory for Chemical Biology of Fujian Province, Key Laboratory of Analytical Chemistry, and Department of Chemical Biology, College of Chemistry and Chemical Engineering, Xiamen University, Xiamen 361005, People's Republic of China; li_jin_lifescience@fudan.edu.cn songjiajia2010@shsmu.edu.cn cyyang@xmu.edu.cn. ORCID
  14. Jin Li: State Key Laboratory of Genetic Engineering and School of Life Sciences, Fudan University, Shanghai 200433, China; li_jin_lifescience@fudan.edu.cn songjiajia2010@shsmu.edu.cn cyyang@xmu.edu.cn. ORCID

Abstract

Single-cell RNA-sequencing (scRNA-seq) has become a powerful tool for biomedical research by providing a variety of valuable information with the advancement of computational tools. Lineage analysis based on scRNA-seq provides key insights into the fate of individual cells in various systems. However, such analysis is limited by several technical challenges. On top of the considerable computational expertise and resources, these analyses also require specific types of matching data such as exogenous barcode information or bulk assay for transposase-accessible chromatin with high throughput sequencing (ATAC-seq) data. To overcome these technical challenges, we developed a user-friendly computational algorithm called "LINEAGE" (label-free identification of endogenous informative single-cell mitochondrial RNA mutation for lineage analysis). Aiming to screen out endogenous markers of lineage located on mitochondrial reads from label-free scRNA-seq data to conduct lineage inference, LINEAGE integrates a marker selection strategy by feature subspace separation and de novo "low cross-entropy subspaces" identification. In this process, the mutation type and subspace-subspace "cross-entropy" of features were both taken into consideration. LINEAGE outperformed three other methods, which were designed for similar tasks as testified with two standard datasets in terms of biological accuracy and computational efficiency. Applied on a label-free scRNA-seq dataset of BRAF-mutated cancer cells, LINEAGE also revealed genes that contribute to BRAF inhibitor resistance. LINEAGE removes most of the technical hurdles of lineage analysis, which will remarkably accelerate the discovery of the important genes or cell-lineage clusters from scRNA-seq data.

Keywords

References

  1. Stat Methods Med Res. 2017 Dec;26(6):2800-2811 [PMID: 26475830]
  2. Nature. 2014 Sep 18;513(7518):422-425 [PMID: 25043003]
  3. Nat Metab. 2020 Dec;2(12):1443-1458 [PMID: 33257854]
  4. Science. 2015 Jan 23;347(6220):400-6 [PMID: 25477212]
  5. Cell Syst. 2021 Aug 18;12(8):810-826.e4 [PMID: 34146472]
  6. Nucleic Acids Res. 2021 Feb 22;49(3):e18 [PMID: 33305325]
  7. Bioinformatics. 2013 Jan 1;29(1):15-21 [PMID: 23104886]
  8. Nat Rev Genet. 2020 Jul;21(7):410-427 [PMID: 32235876]
  9. Genome Res. 2018 Sep;28(9):1353-1363 [PMID: 30061114]
  10. Life Sci Alliance. 2021 Jun 18;4(8): [PMID: 34145025]
  11. Science. 2021 Feb 26;371(6532): [PMID: 33479121]
  12. Chem Commun (Camb). 2006 Feb 14;(6):629-31 [PMID: 16446832]
  13. Nat Methods. 2013 Nov;10(11):1096-8 [PMID: 24056875]
  14. Cell. 2019 Mar 7;176(6):1325-1339.e22 [PMID: 30827679]
  15. Science. 2020 Jan 3;367(6473):45-51 [PMID: 31806696]
  16. Neuron. 2015 Jan 7;85(1):49-59 [PMID: 25569347]
  17. Cell. 2021 Jun 24;184(13):3573-3587.e29 [PMID: 34062119]
  18. Genome Biol. 2020 Jul 15;21(1):174 [PMID: 32669109]
  19. Nutrients. 2018 Sep 29;10(10): [PMID: 30274245]
  20. Semin Cancer Biol. 2019 Dec;59:187-207 [PMID: 31362075]
  21. Nat Biotechnol. 2018 Dec 03;: [PMID: 30531897]
  22. Science. 2020 Feb 14;367(6479): [PMID: 31974159]
  23. BMC Bioinformatics. 2011 Mar 17;12:77 [PMID: 21414208]
  24. Bioinformatics. 2011 Nov 1;27(21):2987-93 [PMID: 21903627]
  25. Nature. 2013 Jun 6;498(7452):109-12 [PMID: 23685455]
  26. Proc Natl Acad Sci U S A. 2005 Oct 25;102(43):15545-50 [PMID: 16199517]
  27. Nat Methods. 2017 Apr;14(4):414-416 [PMID: 28263960]
  28. Mitochondrion. 2015 Jan;20:82-94 [PMID: 25526677]
  29. Front Cell Dev Biol. 2020 Dec 01;8:593007 [PMID: 33335900]
  30. Cell. 2018 Feb 22;172(5):1091-1107.e17 [PMID: 29474909]
  31. PLoS Comput Biol. 2005 Oct;1(5):e50 [PMID: 16261192]
  32. Cell. 2019 Jun 13;177(7):1888-1902.e21 [PMID: 31178118]
  33. Cell. 2009 Aug 7;138(3):449-62 [PMID: 19665969]
  34. Nucleic Acids Res. 2019 Jun 4;47(10):e56 [PMID: 30820578]
  35. Science. 2021 Feb 26;371(6532): [PMID: 33632818]
  36. Science. 2015 Oct 2;350(6256):94-98 [PMID: 26430121]
  37. Cell Res. 2018 Jul;28(7):730-745 [PMID: 29867213]

MeSH Term

Algorithms
Animals
Cell Lineage
Cluster Analysis
Gene Expression Profiling
High-Throughput Nucleotide Sequencing
Humans
Mutation
RNA
RNA, Mitochondrial
Sequence Analysis, RNA
Single-Cell Analysis
Exome Sequencing

Chemicals

RNA, Mitochondrial
RNA

Word Cloud

Similar Articles

Cited By