NSGA-II as feature selection technique and AdaBoost classifier for COVID-19 prediction using patient's symptoms.

Makram Soui, Nesrine Mansouri, Raed Alhamad, Marouane Kessentini, Khaled Ghedira
Author Information
  1. Makram Soui: College of Computing and Informatics, Saudi Electronic University, Riyadh, Saudi Arabia.
  2. Nesrine Mansouri: University of Gabes, Gabes, Tunisia.
  3. Raed Alhamad: College of Computing and Informatics, Saudi Electronic University, Riyadh, Saudi Arabia.
  4. Marouane Kessentini: University of Michigan - Dearborn, Dearborn, MI USA.
  5. Khaled Ghedira: Private Higher School of Engineering and Technology, Ariana, Tunisia.

Abstract

Nowadays, humanity is facing one of the most dangerous pandemics known as COVID-19. Due to its high inter-person contagiousness, COVID-19 is rapidly spreading across the world. Positive patients are often suffering from different symptoms that can vary from mild to severe including cough, fever, sore throat, and body aches. In more dire cases, infected patients can experience severe symptoms that can cause breathing difficulties which lead to stern organ failure and die. The medical corps all over the world are overloaded because of the exponentially myriad number of contagions. Therefore, screening for the disease becomes overwrought with the limited tools of test. Additionally, test results may take a long time to acquire, leaving behind a higher potential for the prevalence of the virus among other individuals by the patients. To reduce the chances of infection, we suggest a prediction model that distinguishes the infected COVID-19 cases based on clinical symptoms and features. This model can be helpful for citizens to catch their infection without the need for visiting the hospital. Also, it helps the medical staff in triaging patients in case of a deficiency of medical amenities. In this paper, we use the non-dominated sorting genetic algorithm (NSGA-II) to select the interesting features by finding the best trade-offs between two conflicting objectives: minimizing the number of features and maximizing the weights of selected features. Then, a classification phase is conducted using an AdaBoost classifier. The proposed model is evaluated using two different datasets. To maximize results, we performed a natural selection of hyper-parameters of the classifier using the genetic algorithm. The obtained results prove the efficiency of NSGA-II as a feature selection algorithm combined with AdaBoost classifier. It exhibits higher classification results that outperformed the existing methods.

Keywords

References

  1. Sci Rep. 2021 May 24;11(1):10738 [PMID: 34031483]
  2. Int J Inf Technol. 2020;12(3):731-739 [PMID: 32838125]
  3. Earth Syst Environ. 2020;4(4):789-796 [PMID: 34723075]
  4. Earth Syst Environ. 2020;4(3):523-534 [PMID: 34723072]
  5. Inform Med Unlocked. 2020;20:100374 [PMID: 32835073]
  6. Iran J Public Health. 2020 Oct;49(Suppl 1):114-116 [PMID: 34268215]
  7. PLoS One. 2020 Jul 28;15(7):e0236238 [PMID: 32722716]
  8. PLoS One. 2020 Jun 26;15(6):e0235187 [PMID: 32589673]
  9. Sci Total Environ. 2020 Aug 10;729:139051 [PMID: 32361460]
  10. Appl Geogr. 2020 May;118:102202 [PMID: 32287518]
  11. Entropy (Basel). 2020 Aug 07;22(8): [PMID: 33286642]
  12. Lancet. 2020 Feb 15;395(10223):507-513 [PMID: 32007143]
  13. Comput Intell Neurosci. 2018 Jun 11;2018:3050214 [PMID: 29991942]
  14. BMC Bioinformatics. 2013 Mar 22;14:106 [PMID: 23522326]
  15. Clin Chem Lab Med. 2020 Jun 25;58(7):1070-1076 [PMID: 32172228]
  16. IEEE Trans Vis Comput Graph. 2018 Jan;24(1):163-173 [PMID: 28866545]
  17. Earth Syst Environ. 2020;4(4):797-811 [PMID: 34723076]
  18. J Med Syst. 2020 Jul 1;44(8):135 [PMID: 32607737]
  19. J Biomed Inform. 2002 Oct-Dec;35(5-6):352-9 [PMID: 12968784]
  20. Earth Syst Environ. 2020;4(3):513-522 [PMID: 34723073]
  21. Earth Syst Environ. 2021;5(1):87-100 [PMID: 34723077]
  22. NPJ Digit Med. 2021 Jan 4;4(1):3 [PMID: 33398013]
  23. Appl Intell (Dordr). 2023;53(4):4874 [PMID: 38624372]
  24. Sci Total Environ. 2020 Aug 15;730:139089 [PMID: 32387823]
  25. Nature. 2020 Mar;579(7798):265-269 [PMID: 32015508]

Word Cloud

Created with Highcharts 10.0.0COVID-19patientssymptomscanresultsfeaturesNSGA-IIusingAdaBoostclassifierselectionmedicalpredictionmodelalgorithmworlddifferentseverecasesinfectednumbertesthigherinfectiongenetictwoclassificationfeatureNowadayshumanityfacingonedangerouspandemicsknownDuehighinter-personcontagiousnessrapidlyspreadingacrossPositiveoftensufferingvarymildincludingcoughfeversorethroatbodyachesdireexperiencecausebreathingdifficultiesleadsternorganfailurediecorpsoverloadedexponentiallymyriadcontagionsThereforescreeningdiseasebecomesoverwroughtlimitedtoolsAdditionallymaytakelongtimeacquireleavingbehindpotentialprevalencevirusamongindividualsreducechancessuggestdistinguishesbasedclinicalhelpfulcitizenscatchwithoutneedvisitinghospitalAlsohelpsstafftriagingcasedeficiencyamenitiespaperusenon-dominatedsortingselectinterestingfindingbesttrade-offsconflictingobjectives:minimizingmaximizingweightsselectedphaseconductedproposedevaluateddatasetsmaximizeperformednaturalhyper-parametersobtainedproveefficiencycombinedexhibitsoutperformedexistingmethodstechniquepatient'sFeatureHyper-parametersoptimizationMachinelearning

Similar Articles

Cited By (8)