Drug-target affinity prediction with extended graph learning-convolutional networks.

Haiou Qi, Ting Yu, Wenwen Yu, Chenxi Liu
Author Information
  1. Haiou Qi: Nursing Department, Sir Run Run Shaw Hospital, Zhejiang University School of Medicine, Hangzhou, 310016, China.
  2. Ting Yu: Operating Room Department, Sir Run Run Shaw Hospital, Zhejiang University School of Medicine, Hangzhou, 310016, China. 3320108@zju.edu.cn.
  3. Wenwen Yu: School of Artificial Intelligence and Automation, Huazhong University of Science and Technology, Wuhan, 430074, China.
  4. Chenxi Liu: School of Medicine and Health Management, Tongji Medical School, Huazhong University of Science and Technology, Wuhan, 430030, China.

Abstract

BACKGROUND: High-performance computing plays a pivotal role in computer-aided drug design, a field that holds significant promise in pharmaceutical research. The prediction of drug-target affinity (DTA) is a crucial stage in this process, potentially accelerating drug development through rapid and extensive preliminary compound screening, while also minimizing resource utilization and costs. Recently, the incorporation of deep learning into DTA prediction and the enhancement of its accuracy have emerged as key areas of interest in the research community. Drugs and targets can be characterized through various methods, including structure-based, sequence-based, and graph-based representations. Despite the progress in structure and sequence-based techniques, they tend to provide limited feature information. Conversely, graph-based approaches have risen to prominence, attracting considerable attention for their comprehensive data representation capabilities. Recent studies have focused on constructing protein and drug molecular graphs using sequences and SMILES, subsequently deriving representations through graph neural networks. However, these graph-based approaches are limited by the use of a fixed adjacent matrix of protein and drug molecular graphs for graph convolution. This limitation restricts the learning of comprehensive feature representations from intricate compound and protein structures, consequently impeding the full potential of graph-based feature representation in DTA prediction. This, in turn, significantly impacts the models' generalization capabilities in the complex realm of drug discovery.
RESULTS: To tackle these challenges, we introduce GLCN-DTA, a model specifically designed for proficiency in DTA tasks. GLCN-DTA innovatively integrates a graph learning module into the existing graph architecture. This module is designed to learn a soft adjacent matrix, which effectively and efficiently refines the contextual structure of protein and drug molecular graphs. This advancement allows for learning richer structural information from protein and drug molecular graphs via graph convolution, specifically tailored for DTA tasks, compared to the conventional fixed adjacent matrix approach. A series of experiments have been conducted to validate the efficacy of the proposed GLCN-DTA method across diverse scenarios. The results demonstrate that GLCN-DTA possesses advantages in terms of robustness and high accuracy.
CONCLUSIONS: The proposed GLCN-DTA model enhances DTA prediction performance by introducing a novel framework that synergizes graph learning operations with graph convolution operations, thereby achieving richer representations. GLCN-DTA does not distinguish between different protein classifications, including structurally ordered and intrinsically disordered proteins, focusing instead on improving feature representation. Therefore, its applicability scope may be more effective in scenarios involving structurally ordered proteins, while potentially being limited in contexts with intrinsically disordered proteins.

Keywords

References

  1. BMC Bioinformatics. 2024 Jan 4;25(1):10 [PMID: 38177981]
  2. RSC Adv. 2020 Jun 1;10(35):20701-20712 [PMID: 35517730]
  3. J Comput Chem. 2009 Dec;30(16):2785-91 [PMID: 19399780]
  4. J Chem Inf Model. 2022 Mar 14;62(5):1308-1317 [PMID: 35200015]
  5. BMC Bioinformatics. 2023 Sep 7;24(1):334 [PMID: 37679724]
  6. Curr Protein Pept Sci. 2018;19(5):468-478 [PMID: 27875970]
  7. Nat Biotechnol. 2007 Jan;25(1):71-5 [PMID: 17211405]
  8. Bioinformatics. 2014 Nov 1;30(21):3128-30 [PMID: 25064567]
  9. Comput Biol Med. 2023 Sep;163:107136 [PMID: 37329615]
  10. Comput Biol Med. 2023 Sep 20;166:107512 [PMID: 37788507]
  11. Nat Med. 2014 Jun;20(6):590-1 [PMID: 24901567]
  12. Nat Rev Drug Discov. 2004 Aug;3(8):673-83 [PMID: 15286734]
  13. BMC Genomics. 2022 Jun 17;23(1):449 [PMID: 35715739]
  14. BMC Bioinformatics. 2019 Dec 24;20(Suppl 25):689 [PMID: 31874614]
  15. Bioinformatics. 2018 Sep 1;34(17):i821-i829 [PMID: 30423097]
  16. J Chem Inf Comput Sci. 2002 Nov-Dec;42(6):1273-80 [PMID: 12444722]
  17. Bioinformatics. 2012 Jan 15;28(2):184-90 [PMID: 22101153]
  18. Nat Biotechnol. 2007 Feb;25(2):197-206 [PMID: 17287757]
  19. Bioinformatics. 2021 May 23;37(8):1140-1147 [PMID: 33119053]
  20. BMC Bioinformatics. 2019 Sep 14;20(1):473 [PMID: 31521110]
  21. IEEE/ACM Trans Comput Biol Bioinform. 2022 Mar-Apr;19(2):718-728 [PMID: 34197324]
  22. J Health Econ. 2016 May;47:20-33 [PMID: 26928437]
  23. Adv Neural Inf Process Syst. 2019 Dec;32:9689-9701 [PMID: 33390682]
  24. J Cheminform. 2017 Apr 18;9(1):24 [PMID: 29086119]
  25. J Chem Inf Comput Sci. 2002 Nov-Dec;42(6):1332-42 [PMID: 12444729]
  26. J Cheminform. 2013 Jun 22;5(1):30 [PMID: 23800010]
  27. BMC Bioinformatics. 2022 Sep 7;23(1):367 [PMID: 36071406]
  28. Methods. 2017 Oct 1;129:81-88 [PMID: 28549952]
  29. Chem Sci. 2022 Jan 5;13(3):816-833 [PMID: 35173947]
  30. Bioinformatics. 2021 Apr 1;36(22-23):5545-5547 [PMID: 33275143]
  31. Nat Rev Drug Discov. 2008 Oct;7(10):807-17 [PMID: 18806753]
  32. Nat Biotechnol. 2011 Oct 30;29(11):1046-51 [PMID: 22037378]
  33. Nat Chem Biol. 2011 Apr;7(4):200-2 [PMID: 21336281]
  34. Bioinformatics. 2020 Jan 1;36(1):41-48 [PMID: 31173061]
  35. J Chem Inf Model. 2018 Jan 22;58(1):27-35 [PMID: 29268609]
  36. Biochem Biophys Res Commun. 2003 Oct 3;309(4):923-8 [PMID: 13679062]
  37. Drug Discov Today. 2013 Feb;18(3-4):110-5 [PMID: 22935104]
  38. Curr Drug Targets. 2017;18(9):1104-1111 [PMID: 27848884]
  39. Science. 2018 Apr 13;360(6385):186-190 [PMID: 29449509]
  40. J Chem Inf Model. 2010 May 24;50(5):742-54 [PMID: 20426451]
  41. Brief Bioinform. 2022 Nov 19;23(6): [PMID: 36411674]
  42. J Chem Inf Model. 2014 Mar 24;54(3):735-43 [PMID: 24521231]
  43. Mol Biosyst. 2012 Jul 6;8(7):1970-8 [PMID: 22538619]
  44. Brief Bioinform. 2016 Jul;17(4):696-712 [PMID: 26283676]
  45. ACS Cent Sci. 2017 Apr 26;3(4):283-293 [PMID: 28470045]
  46. Brief Bioinform. 2015 Mar;16(2):325-37 [PMID: 24723570]
  47. ACS Omega. 2020 Nov 19;5(47):30625-30632 [PMID: 33283111]
  48. J Chem Inf Model. 2016 Dec 27;56(12):2495-2506 [PMID: 28024405]
  49. RNA. 2009 Jun;15(6):1219-30 [PMID: 19369428]
  50. J Chem Inf Comput Sci. 2004 Sep-Oct;44(5):1708-18 [PMID: 15446830]
  51. Bioinformatics. 2020 Aug 15;36(16):4406-4414 [PMID: 32428219]
  52. BMC Bioinformatics. 2023 Sep 30;24(1):367 [PMID: 37777712]
  53. Biomolecules. 2021 Jul 29;11(8): [PMID: 34439785]
  54. Bioinformatics. 2019 Jan 1;35(1):104-111 [PMID: 30561548]

Grants

  1. 2024KY1128/Zhejiang Province Medical and Health Science and Technology Plan Project - General Project for Clinical Observation Research
  2. 2024HLKY09/General Project for Nursing Clinical Research at the Sir Run Run Shaw Hospital affiliated with the Zhejiang University School of Medicine

MeSH Term

Intrinsically Disordered Proteins
Drug Development
Drug Discovery
Drug Delivery Systems
Drug Design

Chemicals

Intrinsically Disordered Proteins

Word Cloud

Created with Highcharts 10.0.0graphdrugpredictionDTAlearningproteinGLCN-DTAgraph-basedrepresentationsfeaturemoleculargraphsaffinitylimitedrepresentationnetworksadjacentmatrixconvolutionproteinsresearchpotentiallycompoundaccuracyincludingsequence-basedstructureinformationapproachescomprehensivecapabilitiesfixeddiscoverymodelspecificallydesignedtasksmodulericherproposedscenariosoperationsstructurallyorderedintrinsicallydisorderedlearning-convolutionalBACKGROUND:High-performancecomputingplayspivotalrolecomputer-aideddesignfieldholdssignificantpromisepharmaceuticaldrug-targetcrucialstageprocessacceleratingdevelopmentrapidextensivepreliminaryscreeningalsominimizingresourceutilizationcostsRecentlyincorporationdeepenhancementemergedkeyareasinterestcommunityDrugstargetscancharacterizedvariousmethodsstructure-basedDespiteprogresstechniquestendprovideConverselyrisenprominenceattractingconsiderableattentiondataRecentstudiesfocusedconstructingusingsequencesSMILESsubsequentlyderivingneuralHoweveruselimitationrestrictsintricatestructuresconsequentlyimpedingfullpotentialturnsignificantlyimpactsmodels'generalizationcomplexrealmRESULTS:tacklechallengesintroduceproficiencyinnovativelyintegratesexistingarchitecturelearnsofteffectivelyefficientlyrefinescontextualadvancementallowsstructuralviatailoredcomparedconventionalapproachseriesexperimentsconductedvalidateefficacymethodacrossdiverseresultsdemonstratepossessesadvantagestermsrobustnesshighCONCLUSIONS:enhancesperformanceintroducingnovelframeworksynergizestherebyachievingdistinguishdifferentclassificationsfocusinginsteadimprovingThereforeapplicabilityscopemayeffectiveinvolvingcontextsDrug-targetextendedDeepDrugDrug–targetGraph

Similar Articles

Cited By