Molecular property prediction based on graph structure learning.

Bangyi Zhao, Weixia Xu, Jihong Guan, Shuigeng Zhou
Author Information
  1. Bangyi Zhao: Shanghai Key Lab of Intelligent Information Processing, and School of Computer Science, Fudan University, Shanghai 200438, China. ORCID
  2. Weixia Xu: Shanghai Key Lab of Intelligent Information Processing, and School of Computer Science, Fudan University, Shanghai 200438, China.
  3. Jihong Guan: Department of Computer Science and Technology, Tongji University, Shanghai 201804, China.
  4. Shuigeng Zhou: Shanghai Key Lab of Intelligent Information Processing, and School of Computer Science, Fudan University, Shanghai 200438, China. ORCID

Abstract

MOTIVATION: Molecular property prediction (MPP) is a fundamental but challenging task in the computer-aided drug discovery process. More and more recent works employ different graph-based models for MPP, which have achieved considerable progress in improving prediction performance. However, current models often ignore relationships between molecules, which could be also helpful for MPP.
RESULTS: For this sake, in this article we propose a graph structure learning (GSL) based MPP approach, called GSL-MPP. Specifically, we first apply graph neural network (GNN) over molecular graphs to extract molecular representations. Then, with molecular fingerprints, we construct a molecule similarity graph (MSG). Following that, we conduct GSL on the MSG, i.e. molecule-level GSL, to get the final molecular embeddings, which are the results of fuzing both GNN encoded molecular representations and the relationships among molecules. That is, combining both intra-molecule and inter-molecule information. Finally, we use these molecular embeddings to perform MPP. Extensive experiments on 10 various benchmark datasets show that our method could achieve state-of-the-art performance in most cases, especially on classification tasks. Further visualization studies also demonstrate the good molecular representations of our method.
AVAILABILITY AND IMPLEMENTATION: Source code is available at https://github.com/zby961104/GSL-MPP.

References

  1. Brief Bioinform. 2020 May 21;21(3):919-935 [PMID: 31155636]
  2. Expert Opin Drug Discov. 2016;11(2):137-48 [PMID: 26558489]
  3. Drug Discov Today Technol. 2020 Dec;37:1-12 [PMID: 34895648]
  4. Nature. 2018 May;557(7707):S55-S57 [PMID: 29849160]
  5. Bioinformatics. 2023 Jan 1;39(1): [PMID: 36416124]
  6. Bioinformatics. 2022 Mar 28;38(7):2003-2009 [PMID: 35094072]
  7. Chem Sci. 2017 Oct 31;9(2):513-530 [PMID: 29629118]
  8. J Chem Inf Model. 2019 Aug 26;59(8):3370-3388 [PMID: 31361484]
  9. J Chem Inf Model. 2006 Jul-Aug;46(4):1535 [PMID: 16859285]
  10. J Med Chem. 2012 Apr 12;55(7):2932-42 [PMID: 22236250]
  11. J Comput Aided Mol Des. 2020 Sep;34(9):929-942 [PMID: 32367387]
  12. J Chem Inf Model. 2010 May 24;50(5):742-54 [PMID: 20426451]
  13. J Comput Aided Mol Des. 2016 Aug;30(8):595-608 [PMID: 27558503]
  14. J Med Chem. 2014 Jan 9;57(1):18-28 [PMID: 23981118]
  15. Int J Mol Sci. 2023 Jan 19;24(3): [PMID: 36768346]
  16. J Med Chem. 2020 Aug 27;63(16):8749-8760 [PMID: 31408336]

Grants

  1. 62372326/National Natural Science Foundation of China

MeSH Term

Neural Networks, Computer
Drug Discovery
Machine Learning
Algorithms

Word Cloud

Created with Highcharts 10.0.0molecularMPPgraphpredictionGSLrepresentationsMolecularpropertymodelsperformancerelationshipsmoleculesalsostructurelearningbasedGNNMSGembeddingsmethodMOTIVATION:fundamentalchallengingtaskcomputer-aideddrugdiscoveryprocessrecentworksemploydifferentgraph-basedachievedconsiderableprogressimprovingHowevercurrentoftenignorehelpfulRESULTS:sakearticleproposeapproachcalledGSL-MPPSpecificallyfirstapplyneuralnetworkgraphsextractfingerprintsconstructmoleculesimilarityFollowingconductiemolecule-levelgetfinalresultsfuzingencodedamongcombiningintra-moleculeinter-moleculeinformationFinallyuseperformExtensiveexperiments10variousbenchmarkdatasetsshowachievestate-of-the-artcasesespeciallyclassificationtasksvisualizationstudiesdemonstrategoodAVAILABILITYANDIMPLEMENTATION:Sourcecodeavailablehttps://githubcom/zby961104/GSL-MPP

Similar Articles

Cited By

No available data.