Deep-Learning Resources for Studying Glycan-Mediated Host-Microbe Interactions.

Daniel Bojar, Rani K Powers, Diogo M Camacho, James J Collins
Author Information
  1. Daniel Bojar: Wyss Institute for Biologically Inspired Engineering, Harvard University, Boston, MA 02115, USA; Department of Biological Engineering and Institute for Medical Engineering & Science, Massachusetts Institute of Technology, Cambridge, MA 02139, USA.
  2. Rani K Powers: Wyss Institute for Biologically Inspired Engineering, Harvard University, Boston, MA 02115, USA; Department of Biological Engineering and Institute for Medical Engineering & Science, Massachusetts Institute of Technology, Cambridge, MA 02139, USA.
  3. Diogo M Camacho: Wyss Institute for Biologically Inspired Engineering, Harvard University, Boston, MA 02115, USA. Electronic address: diogo.camacho@wyss.harvard.edu.
  4. James J Collins: Wyss Institute for Biologically Inspired Engineering, Harvard University, Boston, MA 02115, USA; Department of Biological Engineering and Institute for Medical Engineering & Science, Massachusetts Institute of Technology, Cambridge, MA 02139, USA; Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA. Electronic address: jimjc@mit.edu.

Abstract

Glycans, the most diverse biopolymer, are shaped by evolutionary pressures stemming from host-microbe interactions. Here, we present machine learning and bioinformatics methods to leverage the evolutionary information present in glycans to gain insights into how pathogens and commensals interact with hosts. By using techniques from natural language processing, we develop deep-learning models for glycans that are trained on a curated dataset of 19,299 unique glycans and can be used to study and predict glycan functions. We show that these models can be utilized to predict glycan immunogenicity and the pathogenicity of bacterial strains, as well as investigate glycan-mediated immune evasion via molecular mimicry. We also develop glycan-alignment methods and use these to analyze virulence-determining glycan motifs in the capsular polysaccharides of bacterial pathogens. These resources enable one to identify and study glycan motifs involved in immunogenicity, pathogenicity, molecular mimicry, and immune evasion, expanding our understanding of host-microbe interactions.

Keywords

MeSH Term

Animals
Bacteria
Bacterial Capsules
Bacterial Physiological Phenomena
Computational Biology
Deep Learning
Host Microbial Interactions
Humans
Immune Evasion
Natural Language Processing
Polysaccharides
Polysaccharides, Bacterial
Symbiosis
Virulence

Chemicals

Polysaccharides
Polysaccharides, Bacterial

Word Cloud

Created with Highcharts 10.0.0glycansglycanhost-microbelearningevolutionaryinteractionspresentmachinebioinformaticsmethodspathogensdevelopmodelscanstudypredictimmunogenicitypathogenicitybacterialimmuneevasionmolecularmimicrymotifsGlycansdiversebiopolymershapedpressuresstemmingleverageinformationgaininsightscommensalsinteracthostsusingtechniquesnaturallanguageprocessingdeep-learningtrainedcurateddataset19299uniqueusedfunctionsshowutilizedstrainswellinvestigateglycan-mediatedviaalsoglycan-alignmentuseanalyzevirulence-determiningcapsularpolysaccharidesresourcesenableoneidentifyinvolvedexpandingunderstandingDeep-LearningResourcesStudyingGlycan-MediatedHost-MicrobeInteractionsdeepglycobiology

Similar Articles

Cited By