Short-Range Forecasting of COVID-19 During Early Onset at County, Health District, and State Geographic Levels Using Seven Methods: Comparative Forecasting Study.

Christopher J Lynch, Ross Gore
Author Information
  1. Christopher J Lynch: Virginia Modeling, Analysis, and Simulation Center, Old Dominion University, Suffolk, VA, United States. ORCID
  2. Ross Gore: Virginia Modeling, Analysis, and Simulation Center, Old Dominion University, Suffolk, VA, United States. ORCID

Abstract

BACKGROUND: Forecasting methods rely on trends and averages of prior observations to forecast COVID-19 case counts. COVID-19 forecasts have received much media attention, and numerous platforms have been created to inform the public. However, forecasting effectiveness varies by geographic scope and is affected by changing assumptions in behaviors and preventative measures in response to the pandemic. Due to time requirements for developing a COVID-19 vaccine, evidence is needed to inform short-term forecasting method selection at county, health district, and state levels.
OBJECTIVE: COVID-19 forecasts keep the public informed and contribute to public policy. As such, proper understanding of forecasting purposes and outcomes is needed to advance knowledge of health statistics for policy makers and the public. Using publicly available real-time data provided online, we aimed to evaluate the performance of seven forecasting methods utilized to forecast cumulative COVID-19 case counts. Forecasts were evaluated based on how well they forecast 1, 3, and 7 days forward when utilizing 1-, 3-, 7-, or all prior-day cumulative case counts during early virus onset. This study provides an objective evaluation of the forecasting methods to identify forecasting model assumptions that contribute to lower error in forecasting COVID-19 cumulative case growth. This information benefits professionals, decision makers, and the public relying on the data provided by short-term case count estimates at varied geographic levels.
METHODS: We created 1-, 3-, and 7-day forecasts at the county, health district, and state levels using (1) a naïve approach, (2) Holt-Winters (HW) exponential smoothing, (3) a growth rate approach, (4) a moving average (MA) approach, (5) an autoregressive (AR) approach, (6) an autoregressive moving average (ARMA) approach, and (7) an autoregressive integrated moving average (ARIMA) approach. Forecasts relied on Virginia's 3464 historical county-level cumulative case counts from March 7 to April 22, 2020, as reported by The New York Times. Statistically significant results were identified using 95% CIs of median absolute error (MdAE) and median absolute percentage error (MdAPE) metrics of the resulting 216,698 forecasts.
RESULTS: The next-day MA forecast with 3-day look-back length obtained the lowest MdAE (median 0.67, 95% CI 0.49-0.84, P<.001) and statistically significantly differed from 39 out of 59 alternatives (66%) to 53 out of 59 alternatives (90%) at each geographic level at a significance level of .01. For short-range forecasting, methods assuming stationary means of prior days' counts outperformed methods with assumptions of weak stationarity or nonstationarity means. MdAPE results revealed statistically significant differences across geographic levels.
CONCLUSIONS: For short-range COVID-19 cumulative case count forecasting at the county, health district, and state levels during early onset, the following were found: (1) the MA method was effective for forecasting 1-, 3-, and 7-day cumulative case counts; (2) exponential growth was not the best representation of case growth during early virus onset when the public was aware of the virus; and (3) geographic resolution was a factor in the selection of forecasting methods.

Keywords

References

  1. JAMA. 2020 Apr 7;323(13):1239-1242 [PMID: 32091533]
  2. PLoS Negl Trop Dis. 2016 Jun 15;10(6):e0004761 [PMID: 27304062]
  3. J Med Virol. 2020 Jun;92(6):645-659 [PMID: 32141624]
  4. Stat Med. 2009 Jan 30;28(2):205-20 [PMID: 18991332]
  5. PLoS One. 2020 Mar 31;15(3):e0230405 [PMID: 32231374]
  6. PLoS One. 2020 Oct 13;15(10):e0240648 [PMID: 33048967]
  7. JMIR Public Health Surveill. 2020 Jun 19;6(2):e19353 [PMID: 32427104]
  8. MMWR Morb Mortal Wkly Rep. 2020 Aug 21;69(33):1127-1132 [PMID: 32817606]
  9. Data Brief. 2021 Apr;35:106759 [PMID: 33521186]
  10. Infect Dis Model. 2020 Feb 14;5:256-263 [PMID: 32110742]
  11. IEEE Access. 2020 Oct 08;8:186932-186938 [PMID: 34812360]
  12. Lancet Infect Dis. 2021 Feb;21(2):203-212 [PMID: 33091374]
  13. N Engl J Med. 2020 Mar 26;382(13):1199-1207 [PMID: 31995857]
  14. PLoS One. 2020 May 13;15(5):e0232929 [PMID: 32401795]
  15. Science. 2020 May 15;368(6492):713-714 [PMID: 32332062]
  16. Biosecur Bioterror. 2011 Jun;9(2):89-115 [PMID: 21612363]
  17. Psychol Sci Public Interest. 2007 Nov;8(2):53-96 [PMID: 26161749]
  18. PLoS One. 2020 Oct 14;15(10):e0240348 [PMID: 33052956]
  19. J Med Internet Res. 2020 Aug 17;22(8):e20285 [PMID: 32730217]
  20. Nat Methods. 2014 Feb;11(2):119-20 [PMID: 24645192]
  21. Am Psychol. 2005 Feb-Mar;60(2):170-80 [PMID: 15740449]
  22. Science. 2015 Mar 13;347(6227):aaa4339 [PMID: 25766240]
  23. J Appl Stat. 2021 May 23;50(11-12):2408-2434 [PMID: 37529572]
  24. J Infect Dis. 2020 May 11;221(11):1757-1761 [PMID: 32067043]
  25. PLoS Med. 2020 Aug 11;17(8):e1003244 [PMID: 32780772]
  26. Lancet Infect Dis. 2020 Jun;20(6):669-677 [PMID: 32240634]
  27. Proc Natl Acad Sci U S A. 2020 Jul 14;117(28):16264-16266 [PMID: 32581118]
  28. J Med Internet Res. 2020 Oct 26;22(10):e21632 [PMID: 33027035]
  29. PLoS One. 2020 Jul 28;15(7):e0236238 [PMID: 32722716]
  30. Lancet Infect Dis. 2020 Jul;20(7):793-802 [PMID: 32247326]
  31. BMC Med. 2012 Dec 13;10:165 [PMID: 23237460]
  32. Lancet Infect Dis. 2020 May;20(5):533-534 [PMID: 32087114]
  33. MMWR Morb Mortal Wkly Rep. 2020 Oct 09;69(40):1460-1463 [PMID: 33031366]
  34. PLoS One. 2020 Oct 20;15(10):e0240578 [PMID: 33079964]
  35. BMJ. 2011;343:d2304 [PMID: 22803193]
  36. MMWR Morb Mortal Wkly Rep. 2020 Sep 04;69(35):1198-1203 [PMID: 32881851]
  37. PM R. 2016 Mar;8(3):268-72 [PMID: 26892802]
  38. Proc Natl Acad Sci U S A. 2009 Mar 3;106(9):3243-8 [PMID: 19204283]
  39. Lancet Infect Dis. 2021 Feb;21(2):193-202 [PMID: 33729915]
  40. Health Aff (Millwood). 2020 Sep;39(9):1615-1623 [PMID: 32644825]
  41. JMIR Public Health Surveill. 2020 May 13;6(2):e19115 [PMID: 32391801]
  42. JMIR Public Health Surveill. 2020 Apr 14;6(2):e18828 [PMID: 32234709]
  43. PLoS One. 2020 Mar 31;15(3):e0231236 [PMID: 32231392]
  44. Lancet Infect Dis. 2020 May;20(5):545 [PMID: 32311327]
  45. Science. 2020 Apr 24;368(6489):395-400 [PMID: 32144116]

MeSH Term

COVID-19
Communicable Disease Control
Disease Transmission, Infectious
Early Diagnosis
Forecasting
Humans
Local Government
Pandemics
Residence Characteristics
SARS-CoV-2
State Health Plans
Virginia

Word Cloud

Created with Highcharts 10.0.0forecastingCOVID-19casepublicmethodscountscumulativeapproachgeographichealthlevelsforecastforecastsgrowthForecastingassumptionscountydistrictstate1371-3-earlyvirusonseterrormovingaverageMAautoregressivemediandiseasepriorcreatedinformneededshort-termmethodselectioncontributepolicymakersUsingdataprovidedForecastscount7-dayusing2exponentialsignificantresults95%absoluteMdAEMdAPE0statistically59alternativeslevelshort-rangemeansmodelingBACKGROUND:relytrendsaveragesobservationsreceivedmuchmediaattentionnumerousplatformsHowevereffectivenessvariesscopeaffectedchangingbehaviorspreventativemeasuresresponsepandemicDuetimerequirementsdevelopingvaccineevidenceOBJECTIVE:keepinformedproperunderstandingpurposesoutcomesadvanceknowledgestatisticspubliclyavailablereal-timeonlineaimedevaluateperformancesevenutilizedevaluatedbasedwelldaysforwardutilizing7-prior-daystudyprovidesobjectiveevaluationidentifymodellowerinformationbenefitsprofessionalsdecisionrelyingestimatesvariedMETHODS:naïveHolt-WintersHWsmoothingrate45AR6ARMAintegratedARIMAreliedVirginia's3464historicalcounty-levelMarchApril222020reportedNewYorkTimesStatisticallyidentifiedCIspercentagemetricsresulting216698RESULTS:next-day3-daylook-backlengthobtainedlowest67CI49-084P<001significantlydiffered3966%5390%significance01assumingstationarydays'outperformedweakstationaritynonstationarityrevealeddifferencesacrossCONCLUSIONS:followingfound:effectivebestrepresentationawareresolutionfactorShort-RangeEarlyOnsetCountyHealthDistrictStateGeographicLevelsSevenMethods:ComparativeStudycoronavirus2019emergingoutbreakinfectioussimulationoutbreaks

Similar Articles

Cited By