Host-Virus Chimeric Events in SARS-CoV-2-Infected Cells Are Infrequent and Artifactual.

Bingyu Yan, Srishti Chakravorty, Carmen Mirabelli, Luopin Wang, Jorge L Trujillo-Ochoa, Daniel Chauss, Dhaneshwar Kumar, Michail S Lionakis, Matthew R Olson, Christiane E Wobus, Behdad Afzali, Majid Kazemian
Author Information
  1. Bingyu Yan: Department of Biochemistry, Purdue University, West Lafayette, Indiana, USA.
  2. Srishti Chakravorty: Department of Biochemistry, Purdue University, West Lafayette, Indiana, USA.
  3. Carmen Mirabelli: Department of Microbiology and Immunology, University of Michigan, Ann Arbor, Michigan, USA.
  4. Luopin Wang: Department of Computer Science, Purdue University, West Lafayette, Indiana, USA.
  5. Jorge L Trujillo-Ochoa: Immunoregulation Section, Kidney Diseases Branch, National Institute of Diabetes and Digestive and Kidney Diseases (NIDDK), NIH, Bethesda, Maryland, USA.
  6. Daniel Chauss: Immunoregulation Section, Kidney Diseases Branch, National Institute of Diabetes and Digestive and Kidney Diseases (NIDDK), NIH, Bethesda, Maryland, USA.
  7. Dhaneshwar Kumar: Department of Biochemistry, Purdue University, West Lafayette, Indiana, USA.
  8. Michail S Lionakis: Fungal Pathogenesis Section, Laboratory of Clinical Immunology and Microbiology, National Institute of Allergy and Infectious Diseases (NIAID), NIH, Bethesda, Maryland, USA.
  9. Matthew R Olson: Department of Biological Sciences, Purdue University, West Lafayette, Indiana, USA.
  10. Christiane E Wobus: Department of Microbiology and Immunology, University of Michigan, Ann Arbor, Michigan, USA.
  11. Behdad Afzali: Immunoregulation Section, Kidney Diseases Branch, National Institute of Diabetes and Digestive and Kidney Diseases (NIDDK), NIH, Bethesda, Maryland, USA.
  12. Majid Kazemian: Department of Biochemistry, Purdue University, West Lafayette, Indiana, USA.

Abstract

The pathogenic mechanisms underlying severe SARS-CoV-2 (severe acute respiratory syndrome coronavirus 2) infection remain largely unelucidated. High-throughput sequencing technologies that capture genome and transcriptome information are key approaches to gain detailed mechanistic insights from infected cells. These techniques readily detect both pathogen- and host-derived sequences, providing a means of studying host-pathogen interactions. Recent studies have reported the presence of host-virus chimeric (HVC) RNA in transcriptome sequencing (RNA-seq) data from SARS-CoV-2-infected cells and interpreted these findings as evidence of viral integration in the human genome as a potential pathogenic mechanism. Since SARS-CoV-2 is a positive-sense RNA virus that replicates in the cytoplasm, it does not have a nuclear phase in its life cycle. Thus, it is biologically unlikely to be in a location where splicing events could result in genome integration. Therefore, we investigated the biological authenticity of HVC events. In contrast to true biological events like mRNA splicing and genome rearrangement events, which generate reproducible chimeric sequencing fragments across different biological isolates, we found that HVC events across >100 RNA-seq libraries from patients with coronavirus disease 2019 (COVID-19) and infected cell lines were highly irreproducible. RNA-seq library preparation is inherently error prone due to random template switching during reverse transcription of RNA to cDNA. By counting chimeric events observed when constructing an RNA-seq library from human RNA and spiked-in RNA from an unrelated species, such as the fruit fly, we estimated that ∼1% of RNA-seq reads are artifactually chimeric. In SARS-CoV-2 RNA-seq, we found that the frequency of HVC events was, in fact, not greater than this background "noise." Finally, we developed a novel experimental approach to enrich SARS-CoV-2 sequences from bulk RNA of infected cells. This method enriched viral sequences but did not enrich HVC events, suggesting that the majority of HVC events are, in all likelihood, artifacts of library construction. In conclusion, our findings indicate that HVC events observed in RNA-sequencing libraries from SARS-CoV-2-infected cells are extremely rare and are likely artifacts arising from random template switching of reverse transcriptase and/or sequence alignment errors. Therefore, the observed HVC events do not support SARS-CoV-2 fusion to cellular genes and/or integration into human genomes. The pathogenic mechanisms underlying SARS-CoV-2, the virus responsible for COVID-19, are not fully understood. In particular, relatively little is known about the reasons some individuals develop life-threatening or persistent COVID-19. Recent studies identified host-virus chimeric (HVC) reads in RNA-sequencing data from SARS-CoV-2-infected cells and suggested that HVC events support potential "human genome invasion" and "integration" by SARS-CoV-2. This suggestion has fueled concerns about the long-term effects of current mRNA vaccines that incorporate elements of the viral genome. SARS-CoV-2 is a positive-sense, single-stranded RNA virus that does not encode a reverse transcriptase and does not include a nuclear phase in its life cycle, so some doubts have rightfully been expressed regarding the authenticity of HVCs and the role played by endogenous retrotransposons in this phenomenon. Thus, it is important to independently authenticate these HVC events. Here, we provide several lines of evidence suggesting that the observed HVC events are likely artifactual.

Keywords

References

  1. Lancet Respir Med. 2020 Apr;8(4):420-422 [PMID: 32085846]
  2. Cell. 2020 May 14;181(4):914-921.e10 [PMID: 32330414]
  3. Nat Commun. 2016 Oct 05;7:12992 [PMID: 27703150]
  4. Genomics. 2019 Dec;111(6):1853-1861 [PMID: 30552977]
  5. PLoS Pathog. 2017 Apr 6;13(4):e1006211 [PMID: 28384274]
  6. Nat Commun. 2020 Dec 9;11(1):6319 [PMID: 33298930]
  7. Genome Res. 2016 Sep;26(9):1277-87 [PMID: 27365365]
  8. Nat Genet. 2012 May 27;44(7):765-9 [PMID: 22634754]
  9. Proc Natl Acad Sci U S A. 2011 May 31;108(22):9172-7 [PMID: 21571633]
  10. Proc Natl Acad Sci U S A. 2019 Apr 23;116(17):8445-8450 [PMID: 30962378]
  11. Nature. 2021 Jan;589(7841):270-275 [PMID: 33116299]
  12. Genomics. 2006 Jul;88(1):127-31 [PMID: 16457984]
  13. Genome Biol. 2016 Dec 30;17(1):266 [PMID: 28038678]
  14. iScience. 2021 Mar 19;24(3):102151 [PMID: 33585804]
  15. Cell. 2020 Jun 25;181(7):1502-1517.e23 [PMID: 32559462]
  16. J Virol. 2015 Sep;89(17):8967-73 [PMID: 26085148]
  17. Mol Syst Biol. 2015 Aug 07;11(8):826 [PMID: 26253570]
  18. Sci Immunol. 2021 Apr 7;6(58): [PMID: 33827897]
  19. Cell. 2020 Nov 25;183(5):1325-1339.e21 [PMID: 33080218]
  20. Theranostics. 2020 Sep 1;10(24):10957-10972 [PMID: 33042264]
  21. Front Cell Infect Microbiol. 2021 Feb 25;11:609160 [PMID: 33732659]
  22. J Virol. 2020 Nov 23;94(24): [PMID: 32999023]
  23. Cell. 2020 May 28;181(5):1036-1045.e9 [PMID: 32416070]
  24. Virology. 2017 Sep;509:167-177 [PMID: 28646652]
  25. Annu Rev Microbiol. 2019 Sep 8;73:529-557 [PMID: 31226023]
  26. Cancer Res. 2019 Dec 1;79(23):6010-6023 [PMID: 31481499]

Grants

  1. ZIA/AI001175/HHS | NIH | National Institute of Allergy and Infectious Diseases (NIAID)
  2. ZIA/DK075149/HHS | NIH | National Institute of Diabetes and Digestive and Kidney Diseases (NIDDK)
  3. R35GM138283/HHS | NIH | National Institute of General Medical Sciences (NIGMS)
  4. ZIA DK075149/Intramural NIH HHS
  5. UL1 TR002240/NCATS NIH HHS
  6. Individual Fellowship 841247/Marie Skłodowska-Curie Actions
  7. ZIA AI001175/Intramural NIH HHS
  8. /Office of Dietary Supplements (ODS)
  9. UL1TR002240/MICHR Postdoc Translational Scholar Program
  10. R35 GM138283/NIGMS NIH HHS

MeSH Term

COVID-19
Cell Line, Tumor
Host-Pathogen Interactions
Humans
RNA, Viral
RNA-Seq
SARS-CoV-2
Virus Replication

Chemicals

RNA, Viral