Semi-supervised drug-protein interaction prediction from heterogeneous biological spaces.

Zheng Xia, Ling-Yun Wu, Xiaobo Zhou, Stephen T C Wong
Author Information
  1. Zheng Xia: Bioinformatics and Bioengineering Program, The Methodist Hospital Research Institute, Weill Medical College, Cornell University, Houston, TX 77030, USA. zxia@tmhs.org

Abstract

BACKGROUND: Predicting drug-protein interactions from heterogeneous biological data sources is a key step for in silico drug discovery. The difficulty of this prediction task lies in the rarity of known drug-protein interactions and myriad unknown interactions to be predicted. To meet this challenge, a manifold regularization semi-supervised learning method is presented to tackle this issue by using labeled and unlabeled information which often generates better results than using the labeled data alone. Furthermore, our semi-supervised learning method integrates known drug-protein interaction network information as well as chemical structure and genomic sequence data.
RESULTS: Using the proposed method, we predicted certain drug-protein interactions on the enzyme, ion channel, GPCRs, and nuclear receptor data sets. Some of them are confirmed by the latest publicly available drug targets databases such as KEGG.
CONCLUSIONS: We report encouraging results of using our method for drug-protein interaction network reconstruction which may shed light on the molecular interaction inference and new uses of marketed drugs.

References

  1. Nature. 2009 Nov 12;462(7270):175-81 [PMID: 19881490]
  2. Nat Biotechnol. 2007 Feb;25(2):197-206 [PMID: 17287757]
  3. Curr Opin Chem Biol. 2002 Aug;6(4):439-46 [PMID: 12133718]
  4. Science. 2008 Jul 11;321(5886):263-6 [PMID: 18621671]
  5. Bioinformatics. 2009 Sep 15;25(18):2397-403 [PMID: 19605421]
  6. Bioinformatics. 2008 Oct 1;24(19):2149-56 [PMID: 18676415]
  7. J Am Chem Soc. 2003 Oct 1;125(39):11853-65 [PMID: 14505407]
  8. Nat Biotechnol. 2007 Oct;25(10):1119-26 [PMID: 17921997]
  9. Curr Opin Drug Discov Devel. 2001 Sep;4(5):561-74 [PMID: 12825452]
  10. Nucleic Acids Res. 2006 Jan 1;34(Database issue):D354-7 [PMID: 16381885]
  11. Genome Res. 2008 Feb;18(2):206-13 [PMID: 18083776]
  12. Nucleic Acids Res. 2006 Jan 1;34(Database issue):D668-72 [PMID: 16381955]
  13. Bioinformatics. 2005 Oct 15;21(20):3940-1 [PMID: 16096348]
  14. Bioinformatics. 2008 Jul 1;24(13):i232-40 [PMID: 18586719]

MeSH Term

Artificial Intelligence
Databases, Genetic
Drug Discovery
Drug Interactions
Genome, Human
Humans
Protein Interaction Mapping
Sequence Analysis
Systems Biology
Systems Integration

Word Cloud

Created with Highcharts 10.0.0drug-proteininteractionsdatamethodinteractionusingheterogeneousbiologicaldrugpredictionknownpredictedsemi-supervisedlearninglabeledinformationresultsnetworkBACKGROUND:PredictingsourceskeystepsilicodiscoverydifficultytaskliesraritymyriadunknownmeetchallengemanifoldregularizationpresentedtackleissueunlabeledoftengeneratesbetteraloneFurthermoreintegrateswellchemicalstructuregenomicsequenceRESULTS:UsingproposedcertainenzymeionchannelGPCRsnuclearreceptorsetsconfirmedlatestpubliclyavailabletargetsdatabasesKEGGCONCLUSIONS:reportencouragingreconstructionmayshedlightmolecularinferencenewusesmarketeddrugsSemi-supervisedspaces

Similar Articles

Cited By