ImageDTA: A Simple Model for Drug-Target Binding Affinity Prediction.

Li Han, Ling Kang, Quan Guo
Author Information
  1. Li Han: Software and Big Data Technology Department, Dalian Neusoft University of Information, Dalian, Liaoning 116023, China. ORCID
  2. Ling Kang: Neusoft Research Institute, Dalian Neusoft University of Information, Dalian, Liaoning 116023, China. ORCID
  3. Quan Guo: Neusoft Research Institute, Dalian Neusoft University of Information, Dalian, Liaoning 116023, China. ORCID

Abstract

Predicting the drug-target binding affinity (DTA) is crucial in drug discovery, and an increasing number of researchers are using artificial intelligence techniques to make such predictions. Many effective deep neural network prediction models have been proposed. However, current methods need improvement in accuracy, complexity, and efficiency. In this study, we propose a method based on a multiscale 2-dimensional convolutional neural network (CNN), namely ImageDTA. Many studies have shown that CNN achieves good learning effects with limited data. Therefore, we take a unique perspective by treating the word vector encoded with a simplified molecular input line entry system (SMILES) string as an "image" and processing it like handling images, fully leveraging the efficient processing capabilities of CNN for image data. Furthermore, we show that ImageDTA has higher training and inference efficiency than pretrained large models and outperforms attention-based graph neural network models in accuracy and interpretability. We also use visualization techniques to select appropriate convolutional kernel sizes, thereby increasing the network's interpretability.

References

  1. Bioinformatics. 2018 Sep 1;34(17):i821-i829 [PMID: 30423097]
  2. Bioinformatics. 2023 Jun 1;39(6): [PMID: 37225408]
  3. Nat Biotechnol. 2011 Oct 30;29(11):1046-51 [PMID: 22037378]
  4. J Chem Inf Model. 2014 Mar 24;54(3):735-43 [PMID: 24521231]
  5. Bioinformatics. 2024 Jan 2;40(1): [PMID: 38141210]
  6. Clin Transl Sci. 2018 Nov;11(6):597-606 [PMID: 29940695]
  7. Brief Bioinform. 2022 Jan 17;23(1): [PMID: 34929738]
  8. IEEE J Biomed Health Inform. 2024 Jan 08;PP: [PMID: 38190664]
  9. J Nat Prod. 2020 Mar 27;83(3):770-803 [PMID: 32162523]
  10. Bioinformatics. 2021 May 23;37(8):1140-1147 [PMID: 33119053]
  11. Bioinformatics. 2020 Nov 1;36(17):4633-4642 [PMID: 32462178]
  12. Comput Biol Med. 2023 Sep;164:107372 [PMID: 37597410]
  13. Sci Rep. 2018 May 15;8(1):7598 [PMID: 29765058]

Word Cloud

Created with Highcharts 10.0.0neuralnetworkmodelsCNNincreasingtechniquesManyaccuracyefficiencyconvolutionalImageDTAdataprocessinginterpretabilityPredictingdrug-targetbindingaffinityDTAcrucialdrugdiscoverynumberresearchersusingartificialintelligencemakepredictionseffectivedeeppredictionproposedHowevercurrentmethodsneedimprovementcomplexitystudyproposemethodbasedmultiscale2-dimensionalnamelystudiesshownachievesgoodlearningeffectslimitedThereforetakeuniqueperspectivetreatingwordvectorencodedsimplifiedmolecularinputlineentrysystemSMILESstring"image"likehandlingimagesfullyleveragingefficientcapabilitiesimageFurthermoreshowhighertraininginferencepretrainedlargeoutperformsattention-basedgraphalsousevisualizationselectappropriatekernelsizestherebynetwork'sImageDTA:SimpleModelDrug-TargetBindingAffinityPrediction

Similar Articles

Cited By