Supervised graph co-contrastive learning for drug-target interaction prediction.

Yang Li, Guanyu Qiao, Xin Gao, Guohua Wang
Author Information
  1. Yang Li: College of information and Computer Engineering, Northeast Forestry University, Harbin 150004, China. ORCID
  2. Guanyu Qiao: College of information and Computer Engineering, Northeast Forestry University, Harbin 150004, China.
  3. Xin Gao: Computer, Electrical and Mathematical Science and Engineering Division, King Abdullah University of Science and Technology, Mathematical and Computer Sciences and Engineering, Thuwal 23955, Kingdom of Saudi Arabia. ORCID
  4. Guohua Wang: College of information and Computer Engineering, Northeast Forestry University, Harbin 150004, China. ORCID

Abstract

MOTIVATION: Identification of Drug-Target Interactions (DTIs) is an essential step in drug discovery and repositioning. DTI prediction based on biological experiments is time-consuming and expensive. In recent years, graph learning-based methods have aroused widespread interest and shown certain advantages on this task, where the DTI prediction is often modeled as a binary classification problem of the nodes composed of drug and protein pairs (DPPs). Nevertheless, in many real applications, labeled data are very limited and expensive to obtain. With only a few thousand labeled data, models could hardly recognize comprehensive patterns of DPP node representations, and are unable to capture enough commonsense knowledge, which is required in DTI prediction. Supervised contrastive learning gives an aligned representation of DPP node representations with the same class label. In embedding space, DPP node representations with the same label are pulled together, and those with different labels are pushed apart.
RESULTS: We propose an end-to-end supervised graph co-contrastive learning model for DTI prediction directly from heterogeneous networks. By contrasting the topology structures and semantic features of the drug-protein-pair network, as well as the new selection strategy of positive and negative samples, SGCL-DTI generates a contrastive loss to guide the model optimization in a supervised manner. Comprehensive experiments on three public datasets demonstrate that our model outperforms the SOTA methods significantly on the task of DTI prediction, especially in the case of cold start. Furthermore, SGCL-DTI provides a new research perspective of contrastive learning for DTI prediction.
AVAILABILITY AND IMPLEMENTATION: The research shows that this method has certain applicability in the discovery of drugs, the identification of drug-target pairs and so on.

MeSH Term

Drug Development
Drug Discovery
Drug Interactions
Machine Learning
Proteins

Chemicals

Proteins

Word Cloud

Created with Highcharts 10.0.0predictionDTIlearninggraphDPPnoderepresentationscontrastivemodeldrugdiscoveryexperimentsexpensivemethodscertaintaskpairslabeleddataSupervisedlabelsupervisedco-contrastivenewSGCL-DTIresearchdrug-targetMOTIVATION:IdentificationDrug-TargetInteractionsDTIsessentialsteprepositioningbasedbiologicaltime-consumingrecentyearslearning-basedarousedwidespreadinterestshownadvantagesoftenmodeledbinaryclassificationproblemnodescomposedproteinDPPsNeverthelessmanyrealapplicationslimitedobtainthousandmodelshardlyrecognizecomprehensivepatternsunablecaptureenoughcommonsenseknowledgerequiredgivesalignedrepresentationclassembeddingspacepulledtogetherdifferentlabelspushedapartRESULTS:proposeend-to-enddirectlyheterogeneousnetworkscontrastingtopologystructuressemanticfeaturesdrug-protein-pairnetworkwellselectionstrategypositivenegativesamplesgenerateslossguideoptimizationmannerComprehensivethreepublicdatasetsdemonstrateoutperformsSOTAsignificantlyespeciallycasecoldstartFurthermoreprovidesperspectiveAVAILABILITYANDIMPLEMENTATION:showsmethodapplicabilitydrugsidentificationoninteraction

Similar Articles

Cited By (22)