BBB-PEP-prediction: improved computational model for identification of blood-brain barrier peptides using blending position relative composition specific features and ensemble modeling.

Ansar Naseem, Fahad Alturise, Tamim Alkhalifah, Yaser Daanial Khan
Author Information
  1. Ansar Naseem: Department of Artificial Intelligence, School of Systems and Technology, University of Management and Technology, Lahore, Pakistan.
  2. Fahad Alturise: Department of Computer, College of Science and Arts in Ar Rass, Qassim University, Ar Rass, Saudi Arabia. falturise@qu.edu.sa.
  3. Tamim Alkhalifah: Department of Computer, College of Science and Arts in Ar Rass, Qassim University, Ar Rass, Saudi Arabia.
  4. Yaser Daanial Khan: Department of Computer Science, School of Systems and Technology, University of Management and Technology, Lahore, Pakistan.

Abstract

BBPs have the potential to facilitate the delivery of drugs to the brain, opening up new avenues for the development of treatments targeting diseases of the central nervous system (CNS). The obstacle faced in central nervous system disorders stems from the formidable task of traversing the blood-brain barrier (BBB) for pharmaceutical agents. Nearly 98% of small molecule-based drugs and nearly 100% of large molecule-based drugs encounter difficulties in successfully penetrating the BBB. This importance leads to identification of these peptides, can help in healthcare systems. In this study, we proposed an improved intelligent computational model BBB-PEP-Prediction for identification of BBB peptides. Position and statistical moments based features have been computed for acquired benchmark dataset. Four types of ensembles such as bagging, boosting, stacking and blending have been utilized in the methodology section. Bagging employed Random Forest (RF) and Extra Trees (ET), Boosting utilizes XGBoost (XGB) and Light Gradient Boosting Machine (LGBM). Stacking uses ET and XGB as base learners, blending exploited LGBM and RF as base learners, while Logistic Regression (LR) has been applied as Meta learner for stacking and blending. Three classifiers such as LGBM, XGB and ET have been optimized by using Randomized search CV. Four types of testing such as self-consistency, independent set, cross-validation with 5 and 10 folds and jackknife test have been employed. Evaluation metrics such as Accuracy (ACC), Specificity (SPE), Sensitivity (SEN), Mathew's correlation coefficient (MCC) have been utilized. The stacking of classifiers has shown best results in almost each testing. The stacking results for independent set testing exhibits accuracy, specificity, sensitivity and MCC score of 0.824, 0.911, 0.831 and 0.663 respectively. The proposed model BBB-PEP-Prediction shown superlative performance as compared to previous benchmark studies. The proposed system helps in future research and research community for in-silico identification of BBB peptides.

Keywords

References

  1. IEEE/ACM Trans Comput Biol Bioinform. 2021 Sep-Oct;18(5):2045-2056 [PMID: 31985438]
  2. Brain Struct Funct. 2021 Nov;226(8):2489-2495 [PMID: 34269889]
  3. Neurobiol Dis. 2010 Jan;37(1):13-25 [PMID: 19664713]
  4. Science. 1993 Jan 15;259(5093):373-7 [PMID: 8420006]
  5. Adv Drug Deliv Rev. 2012 May 15;64(7):589 [PMID: 22388004]
  6. Oncotarget. 2017 Jul 11;8(28):46635-46651 [PMID: 28422728]
  7. Curr Genomics. 2019 Feb;20(2):124-133 [PMID: 31555063]
  8. Anal Biochem. 2021 Nov 15;633:114385 [PMID: 34571005]
  9. Pharmaceutics. 2021 Aug 11;13(8): [PMID: 34452198]
  10. Sci Rep. 2021 Nov 5;11(1):21767 [PMID: 34741132]
  11. Front Genet. 2022 May 17;13:845747 [PMID: 35656322]
  12. Nat Rev Neurol. 2018 Mar;14(3):133-150 [PMID: 29377008]
  13. Digit Health. 2023 Jul 5;9:20552076231180739 [PMID: 37434723]
  14. Diagnostics (Basel). 2023 Jun 01;13(11): [PMID: 37296792]
  15. IEEE/ACM Trans Comput Biol Bioinform. 2021 Mar-Apr;18(2):596-610 [PMID: 31144645]
  16. Appl Bionics Biomech. 2022 Apr 13;2022:5483115 [PMID: 35465187]
  17. Nat Rev Neurosci. 2006 Jan;7(1):41-53 [PMID: 16371949]
  18. Brain Struct Funct. 2012 Jul;217(3):687-718 [PMID: 22205159]
  19. Anal Biochem. 2021 Feb 15;615:114069 [PMID: 33340540]
  20. PeerJ. 2021 Aug 4;9:e11581 [PMID: 34430072]
  21. Physiol Rev. 2019 Jan 1;99(1):21-78 [PMID: 30280653]
  22. Mol Membr Biol. 2014 Aug;31(5):152-67 [PMID: 25046533]
  23. Curr Genomics. 2019 May;20(4):306-320 [PMID: 32030089]
  24. NeuroRx. 2005 Jan;2(1):3-14 [PMID: 15717053]
  25. J Chem Inf Model. 2021 Jan 25;61(1):525-534 [PMID: 33426873]
  26. PeerJ Comput Sci. 2023 May 23;9:e1353 [PMID: 37346628]
  27. Comput Intell Neurosci. 2023 Jan 25;2023:2465414 [PMID: 36744119]
  28. Sci Rep. 2021 Jun 10;11(1):12281 [PMID: 34112883]
  29. Comb Chem High Throughput Screen. 2020;23(8):797-804 [PMID: 32342804]
  30. Anal Biochem. 2020 Jan 1;588:113477 [PMID: 31654612]

Word Cloud

Created with Highcharts 10.0.0BBBidentificationpeptidesstackingblending0drugssystemproposedmodelETXGBLGBMtestingcentralnervousblood-brainbarriermolecule-basedimprovedcomputationalBBB-PEP-PredictionfeaturesbenchmarkFourtypesutilizedemployedRFBoostingMachinebaselearnersclassifiersusingindependentsetMCCshownresultsresearchLearningBBPspotentialfacilitatedeliverybrainopeningnewavenuesdevelopmenttreatmentstargetingdiseasesCNSobstaclefaceddisordersstemsformidabletasktraversingpharmaceuticalagentsNearly98%smallnearly100%largeencounterdifficultiessuccessfullypenetratingimportanceleadscanhelphealthcaresystemsstudyintelligentPositionstatisticalmomentsbasedcomputedacquireddatasetensemblesbaggingboostingmethodologysectionBaggingRandomForestExtraTreesutilizesXGBoostLightGradientStackingusesexploitedLogisticRegressionLRappliedMetalearnerThreeoptimizedRandomizedsearchCVself-consistencycross-validation510foldsjackknifetestEvaluationmetricsAccuracyACCSpecificitySPESensitivitySENMathew'scorrelationcoefficientbestalmostexhibitsaccuracyspecificitysensitivityscore824911831663respectivelysuperlativeperformancecomparedpreviousstudieshelpsfuturecommunityin-silicoBBB-PEP-prediction:positionrelativecompositionspecificensemblemodelingDataMiningEnsembleModelingPatternRecognitionPeptideClassificationSequenceAnalysisSupervisedTransferLearningArtificialintelligence

Similar Articles

Cited By (2)