SSR-DTA: Substructure-aware multi-layer graph neural networks for drug-target binding affinity prediction.

Yuansheng Liu, Xinyan Xia, Yongshun Gong, Bosheng Song, Xiangxiang Zeng
Author Information
  1. Yuansheng Liu: College of Computer Science and Electronic Engineering, Hunan University, Changsha, 410086, Hunan, China; Key Laboratory of Intelligent Computing & Signal Processing of Ministry of Education, Anhui University, Hefei, 230601, Anhui, China.
  2. Xinyan Xia: College of Computer Science and Electronic Engineering, Hunan University, Changsha, 410086, Hunan, China.
  3. Yongshun Gong: School of Software, Shandong University, Jinan, 250100, Shandong, China.
  4. Bosheng Song: College of Computer Science and Electronic Engineering, Hunan University, Changsha, 410086, Hunan, China.
  5. Xiangxiang Zeng: College of Computer Science and Electronic Engineering, Hunan University, Changsha, 410086, Hunan, China. Electronic address: xzeng@hnu.edu.cn.

Abstract

Accurate prediction of drug-target binding affinity (DTA) is essential in the field of drug discovery. Recently, scientists have been attempting to utilize artificial intelligence prediction to screen out a significant number of ineffective compounds, thereby mitigating labor and financial losses. While graph neural networks (GNNs) have been applied to DTA, existing GNNs have limitations in effectively extracting substructural features across various sizes. Functional groups play a crucial role in modulating molecular properties, but existing GNNs struggle with feature extraction from certain motifs due to scale mismatches. Additionally, sequence-based models for target proteins lack the integration of structural information. To address these limitations, we present SSR-DTA, a multi-layer graph network capable of adapting to diverse structural sizes, which can extract richer biological features, thereby improving the robustness and accuracy of predictions. Multi-layer GNNs enable the capture of molecular motifs across different scales, ranging from atomic to macrocyclic motifs. Furthermore, we introduce BiGNN to simultaneously learn sequence and structural information. Sequence information corresponds to the primary structure of proteins, while graph information represents the tertiary structure. BiGNN assimilates richer information compared to sequence-based methods while mitigating the impact of errors from predicted structures, resulting in more accurate predictions. Through rigorous experimental evaluations conducted on four benchmark datasets, we demonstrate the superiority of SSR-DTA over state-of-the-art models. Particularly, in comparison to state-of-the-art models, SSR-DTA demonstrates an impressive 20% reduction in mean squared error on the Davis dataset and a 5% reduction on the KIBA dataset, underscoring its potential as a valuable tool for advancing DTA prediction.

Keywords

MeSH Term

Neural Networks, Computer
Drug Discovery
Proteins
Humans
Protein Binding

Chemicals

Proteins

Word Cloud

Created with Highcharts 10.0.0predictioninformationgraphGNNsaffinityDTAneuralnetworksmotifsmodelsstructuralSSR-DTAdrug-targetbindingtherebymitigatingexistinglimitationsfeaturesacrosssizesmolecularsequence-basedproteinsmulti-layerricherpredictionsBiGNNstructurestate-of-the-artreductiondatasetlearningAccurateessentialfielddrugdiscoveryRecentlyscientistsattemptingutilizeartificialintelligencescreensignificantnumberineffectivecompoundslaborfinanciallossesappliedeffectivelyextractingsubstructuralvariousFunctionalgroupsplaycrucialrolemodulatingpropertiesstrugglefeatureextractioncertainduescalemismatchesAdditionallytargetlackintegrationaddresspresentnetworkcapableadaptingdiversecanextractbiologicalimprovingrobustnessaccuracyMulti-layerenablecapturedifferentscalesrangingatomicmacrocyclicFurthermoreintroducesimultaneouslylearnsequenceSequencecorrespondsprimaryrepresentstertiaryassimilatescomparedmethodsimpacterrorspredictedstructuresresultingaccuraterigorousexperimentalevaluationsconductedfourbenchmarkdatasetsdemonstratesuperiorityParticularlycomparisondemonstratesimpressive20%meansquarederrorDavis5%KIBAunderscoringpotentialvaluabletooladvancingSSR-DTA:Substructure-awareDeepDrug–targetFeaturerepresentationGraph

Similar Articles

Cited By