Transfer learning-based hybrid VGG16-machine learning approach for heart disease detection with explainable artificial intelligence.

Eshetie Gizachew Addisu, Tahayu Gizachew Yirga, Hailu Gizachew Yirga, Alemu Demeke Yehuala
Author Information
  1. Eshetie Gizachew Addisu: Department of Information Systems, College Informatics, University of Gondar, Gondar, Ethiopia.
  2. Tahayu Gizachew Yirga: Department of Computer Science, College of Natural and Computational Science, Mekdela Amba University, Tulu Awuliya, Ethiopia.
  3. Hailu Gizachew Yirga: Department of Computer Science, College of Informatics, University of Gondar, Gondar, Ethiopia.
  4. Alemu Demeke Yehuala: Department of Surgery, College of Medicine and Health Science, University of Gondar, Gondar, Ethiopia.

Abstract

Heart disease is a leading cause of mortality worldwide, making accurate early detection essential for effective treatment and management. This study introduces a novel hybrid machine-learning approach that combines transfer learning using the VGG16 convolutional neural network (CNN) with various machine-learning classifiers for Heart disease detection. A conditional tabular generative adversarial network (CTGAN) was employed to generate synthetic data samples from actual datasets; these were evaluated using statistical metrics, correlation analysis, and domain expert assessments to ensure the quality of the synthetic datasets. The dataset comprises tabular data with 13 features, which are reshaped into an image-like format and resized to 224x224x3 to meet the input requirements of the VGG16 model. Feature extraction is performed using VGG16, and the extracted features are then fused with the original tabular data. This combined feature set is then used to train various machine learning models, including Support Vector Machines (SVM), Gradient Boosting, Random Forest, Logistic Regression, K-nearest neighbors (KNN), and Decision Trees. Among these models, the VGG16-Random Forest hybrid achieved notable results across all evaluation metrics, including 92% accuracy, 91.3% precision, 92.2% recall, 91.82% specificity, 92.2% sensitivity, and 91.75% F1-score. The hybrid models were also evaluated using unseen datasets to assess the generalizability of the proposed approaches, with the VGG16-Random Forest combination showing relatively promising results. Additionally, explainability is integrated into the model using SHAP values, providing insights into the contribution of each feature to the model's predictions. This hybrid VGG16-ML approach demonstrates the potential for highly accurate and interpretable Heart disease detection, offering valuable support in clinical decision-making processes.

Keywords

References

  1. Front Med (Lausanne). 2023 Apr 17;10:1150933 [PMID: 37138750]
  2. Bioengineering (Basel). 2023 Jul 03;10(7): [PMID: 37508823]
  3. Indian J Psychol Med. 2019 May-Jun;41(3):210-215 [PMID: 31142921]
  4. Big Data. 2021 Feb;9(1):3-21 [PMID: 33275484]
  5. Sensors (Basel). 2020 May 06;20(9): [PMID: 32384737]
  6. Diagnostics (Basel). 2022 Dec 18;12(12): [PMID: 36553222]
  7. Diagnostics (Basel). 2023 Jul 17;13(14): [PMID: 37510136]
  8. Comput Intell Neurosci. 2021 Jul 1;2021:8387680 [PMID: 34306056]

Word Cloud

Created with Highcharts 10.0.0diseasehybridlearningusingdetectionheartapproachVGG16tabulardatadatasetsfeaturemodelsForest91accuratemachine-learningnetworkvariousCTGANsyntheticevaluatedmetricsfeaturesmodelextractionmachineincludingVGG16-Randomresults922%explainabilityartificialintelligenceHeartleadingcausemortalityworldwidemakingearlyessentialeffectivetreatmentmanagementstudyintroducesnovelcombinestransferconvolutionalneuralCNNclassifiersconditionalgenerativeadversarialemployedgeneratesamplesactualstatisticalcorrelationanalysisdomainexpertassessmentsensurequalitydatasetcomprises13reshapedimage-likeformatresized224x224x3meetinputrequirementsFeatureperformedextractedfusedoriginalcombinedsetusedtrainSupportVectorMachinesSVMGradientBoostingRandomLogisticRegressionK-nearestneighborsKNNDecisionTreesAmongachievednotableacrossevaluation92%accuracy3%precisionrecall82%specificitysensitivity75%F1-scorealsounseenassessgeneralizabilityproposedapproachescombinationshowingrelativelypromisingAdditionallyintegratedSHAPvaluesprovidinginsightscontributionmodel'spredictionsVGG16-MLdemonstratespotentialhighlyinterpretableofferingvaluablesupportclinicaldecision-makingprocessesTransferlearning-basedVGG16-machineexplainableVGG16-randomforestdeep

Similar Articles

Cited By

No available data.