A Heterogeneous Ensemble Forecasting Model for Disease Prediction.

Nonita Sharma, Jaiditya Dev, Monika Mangla, Vaishali Mehta Wadhwa, Sachi Nandan Mohanty, Deepti Kakkar
Author Information
  1. Nonita Sharma: Dr. B. R. Ambedkar, National Institute of Technology Jalandhar, Jalandhar, Punjab India. ORCID
  2. Jaiditya Dev: Mayoor School, Noida, Uttar Pradesh India.
  3. Monika Mangla: Lokmanya Tilak College of Engineering, Navi Mumbai, Maharashtra India.
  4. Vaishali Mehta Wadhwa: Panipat Institute of Engineering and Technology, Panipat, Haryana India.
  5. Sachi Nandan Mohanty: IcfaiTech, ICFAI Foundation for Higher Education, Hyderabad, Telangana India.
  6. Deepti Kakkar: Dr. B. R. Ambedkar, National Institute of Technology Jalandhar, Jalandhar, Punjab India.

Abstract

The manuscript presents a bragging-based ensemble forecasting model for predicting the number of incidences of a disease based on past occurrences. The objectives of this research work are to enhance accuracy, reduce overfitting, and handle overdrift; the proposed model has shown promising results in terms of error metrics. The collated dataset of the diseases is collected from the official government site of Hong Kong from the year 2010 to 2019. The preprocessing is done using log transformation and score transformation. The proposed ensemble model is applied, and its applicability to a specific disease dataset is presented. The proposed ensemble model is compared against the ensemble models, namely dynamic ensemble for time series, arbitrated dynamic ensemble, and random forest using different error metrics. The proposed model shows the reduced value of MAE (mean average error) by 27.18%, 3.07%, 11.58%, 13.46% for tuberculosis, dengue, food poisoning, and chickenpox, respectively. The comparison drawn between the proposed model and the existing models shows that the proposed ensemble model gives better accuracy in the case of all the four-disease datasets.

Keywords

Word Cloud

Created with Highcharts 10.0.0ensemblemodelproposedforecastingerrordiseaseaccuracymetricsdatasetusingtransformationmodelsdynamicseriesshowsEnsembleDiseasemanuscriptpresentsbragging-basedpredictingnumberincidencesbasedpastoccurrencesobjectivesresearchworkenhancereduceoverfittinghandleoverdriftshownpromisingresultstermscollateddiseasescollectedofficialgovernmentsiteHongKongyear20102019preprocessingdonelogscoreappliedapplicabilityspecificpresentedcomparednamelytimearbitratedrandomforestdifferentreducedvalueMAEmeanaverage2718%307%1158%1346%tuberculosisdenguefoodpoisoningchickenpoxrespectivelycomparisondrawnexistinggivesbettercasefour-diseasedatasetsHeterogeneousForecastingModelPredictionBootstrappingBraggingTime

Similar Articles

Cited By