Machine learning based predictors for COVID-19 disease severity.

Dhruv Patel, Vikram Kher, Bhushan Desai, Xiaomeng Lei, Steven Cen, Neha Nanda, Ali Gholamrezanezhad, Vinay Duddalwar, Bino Varghese, Assad A Oberai
Author Information
  1. Dhruv Patel: Viterbi School of Engineering, University of Southern California, Los Angeles, CA, USA.
  2. Vikram Kher: Viterbi School of Engineering, University of Southern California, Los Angeles, CA, USA.
  3. Bhushan Desai: Keck School of Medicine, University of Southern California, Los Angeles, CA, USA.
  4. Xiaomeng Lei: Keck School of Medicine, University of Southern California, Los Angeles, CA, USA.
  5. Steven Cen: Keck School of Medicine, University of Southern California, Los Angeles, CA, USA.
  6. Neha Nanda: Keck School of Medicine, University of Southern California, Los Angeles, CA, USA.
  7. Ali Gholamrezanezhad: Keck School of Medicine, University of Southern California, Los Angeles, CA, USA.
  8. Vinay Duddalwar: Keck School of Medicine, University of Southern California, Los Angeles, CA, USA.
  9. Bino Varghese: Keck School of Medicine, University of Southern California, Los Angeles, CA, USA.
  10. Assad A Oberai: Viterbi School of Engineering, University of Southern California, Los Angeles, CA, USA. aoberai@usc.edu.

Abstract

Predictors of the need for intensive care and mechanical ventilation can help healthcare systems in planning for surge capacity for COVID-19. We used socio-demographic data, clinical data, and blood panel profile data at the time of initial presentation to develop machine learning algorithms for predicting the need for intensive care and mechanical ventilation. Among the algorithms considered, the Random Forest classifier performed the best with [Formula: see text] for predicting ICU need and [Formula: see text] for predicting the need for mechanical ventilation. We also determined the most influential features in making this prediction, and concluded that all three categories of data are important. We determined the relative importance of blood panel profile data and noted that the AUC dropped by 0.12 units when this data was not included, thus indicating that it provided valuable information in predicting disease severity. Finally, we generated RF predictors with a reduced set of five features that retained the performance of the predictors trained on all features. These predictors, which rely only on quantitative data, are less prone to errors and subjectivity.

References

  1. J Biomed Inform. 2009 Apr;42(2):377-81 [PMID: 18929686]
  2. Clin Microbiol Infect. 2020 Jun;26(6):767-772 [PMID: 32304745]
  3. J Infect. 2020 Jul;81(1):e95-e97 [PMID: 32305490]
  4. JAMA Intern Med. 2020 Aug 1;180(8):1081-1089 [PMID: 32396163]
  5. Invest Radiol. 2020 Jun;55(6):327-331 [PMID: 32118615]
  6. Nat Commun. 2020 Jul 15;11(1):3543 [PMID: 32669540]
  7. J Clin Med. 2020 Jun 01;9(6): [PMID: 32492874]
  8. Clin Chim Acta. 2020 Jun;505:190-191 [PMID: 32145275]
  9. J Infect. 2020 Aug;81(2):205-212 [PMID: 32579986]
  10. Comput Math Methods Med. 2017;2017:3762651 [PMID: 28642804]
  11. Mayo Clin Proc. 2020 Jun;95(6):1138-1147 [PMID: 32376101]
  12. Aging Dis. 2020 Jul 23;11(4):763-769 [PMID: 32765943]
  13. J Clin Virol. 2020 Jun;127:104370 [PMID: 32344321]
  14. Radiology. 2021 Jan;298(1):E46-E54 [PMID: 32787701]

MeSH Term

Adolescent
Adult
Aged
Aged, 80 and over
COVID-19
Cohort Studies
Female
Humans
Intensive Care Units
Machine Learning
Male
Middle Aged
Prognosis
Risk Factors
SARS-CoV-2
Severity of Illness Index
Young Adult

Word Cloud

Created with Highcharts 10.0.0dataneedpredictingpredictorsmechanicalventilationfeaturesintensivecareCOVID-19bloodpanelprofilelearningalgorithms[Formula:seetext]determineddiseaseseverityPredictorscanhelphealthcaresystemsplanningsurgecapacityusedsocio-demographicclinicaltimeinitialpresentationdevelopmachineAmongconsideredRandomForestclassifierperformedbestICUalsoinfluentialmakingpredictionconcludedthreecategoriesimportantrelativeimportancenotedAUCdropped012unitsincludedthusindicatingprovidedvaluableinformation inFinallygeneratedRFreducedsetfiveretainedperformancetrainedrelyquantitativelessproneerrorssubjectivityMachinebased

Similar Articles

Cited By