Disease surveillance using a hidden Markov model.

Rochelle E Watkins, Serryn Eagleson, Bert Veenendaal, Graeme Wright, Aileen J Plant
Author Information
  1. Rochelle E Watkins: Curtin Health Innovation Research Institute, Curtin University of Technology, Perth, Australia. Rochelle.Watkins@curtin.edu.au

Abstract

BACKGROUND: Routine surveillance of disease notification data can enable the early detection of localised disease outbreaks. Although hidden Markov models (HMMs) have been recognised as an appropriate method to model disease surveillance data, they have been rarely applied in public health practice. We aimed to develop and evaluate a simple flexible HMM for disease surveillance which is suitable for use with sparse small area count data and requires little baseline data.
METHODS: A Bayesian HMM was designed to monitor routinely collected notifiable disease data that are aggregated by residential postcode. Semi-synthetic data were used to evaluate the algorithm and compare outbreak detection performance with the established Early Aberration Reporting System (EARS) algorithms and a negative binomial cusum.
RESULTS: Algorithm performance varied according to the desired false alarm rate for surveillance. At false alarm rates around 0.05, the cusum-based algorithms provided the best overall outbreak detection performance, having similar sensitivity to the HMMs and a shorter average time to detection. At false alarm rates around 0.01, the HMM algorithms provided the best overall outbreak detection performance, having higher sensitivity than the cusum-based Methods and a generally shorter time to detection for larger outbreaks. Overall, the 14-day HMM had a significantly greater area under the receiver operator characteristic curve than the EARS C3 and 7-day negative binomial cusum algorithms.
CONCLUSION: Our findings suggest that the HMM provides an effective method for the surveillance of sparse small area notifiable disease data at low false alarm rates. Further investigations are required to evaluation algorithm performance across other diseases and surveillance contexts.

References

  1. Public Health Rep. 1963 Jun;78(6):494-506 [PMID: 19316455]
  2. BMC Med Inform Decis Mak. 2007 Feb 14;7:4 [PMID: 17300714]
  3. BMC Med Inform Decis Mak. 2008 Aug 13;8:37 [PMID: 18700044]
  4. Clin Infect Dis. 2006 Oct 15;43(8):1016-22 [PMID: 16983614]
  5. Stat Med. 2005 Feb 28;24(4):543-50 [PMID: 15678442]
  6. Stat Methods Med Res. 2006 Aug;15(4):337-52 [PMID: 16886735]
  7. BMC Med Inform Decis Mak. 2007 Mar 01;7:6 [PMID: 17331250]
  8. J Am Med Inform Assoc. 2008 Jan-Feb;15(1):77-86 [PMID: 17947614]
  9. J Public Health Manag Pract. 2005 Jul-Aug;11(4):274-81 [PMID: 15958924]
  10. Stat Med. 1999 Dec 30;18(24):3463-78 [PMID: 10611619]
  11. Stat Methods Med Res. 2006 Oct;15(5):445-64 [PMID: 17089948]
  12. BMC Med Inform Decis Mak. 2007 Jun 13;7:15 [PMID: 17567912]
  13. PLoS Med. 2005 Mar;2(3):e59 [PMID: 15719066]
  14. Stat Med. 2008 Jul 30;27(17):3407-29 [PMID: 18240128]

MeSH Term

Algorithms
Hepatitis A
Markov Chains
Population Surveillance
ROC Curve
Western Australia

Word Cloud

Created with Highcharts 10.0.0surveillancedatadiseasedetectionHMMperformancealgorithmsfalsealarmareaoutbreakratesoutbreakshiddenMarkovHMMsmethodmodelevaluatesparsesmallnotifiablealgorithmEARSnegativebinomialcusumaround0cusum-basedprovidedbestoverallsensitivityshortertimeBACKGROUND:RoutinenotificationcanenableearlylocalisedAlthoughmodelsrecognisedappropriaterarelyappliedpublichealthpracticeaimeddevelopsimpleflexiblesuitableusecountrequireslittlebaselineMETHODS:BayesiandesignedmonitorroutinelycollectedaggregatedresidentialpostcodeSemi-syntheticusedcompareestablishedEarlyAberrationReportingSystemRESULTS:Algorithmvariedaccordingdesiredrate05similaraverage01higherMethodsgenerallylargerOverall14-daysignificantlygreaterreceiveroperatorcharacteristiccurveC37-dayCONCLUSION:findingssuggestprovideseffectivelowinvestigationsrequiredevaluationacrossdiseasescontextsDiseaseusing

Similar Articles

Cited By