Multiple imputation for multivariate data with missing and below-threshold measurements: time-series concentrations of pollutants in the Arctic.

P K Hopke, C Liu, D B Rubin
Author Information
  1. P K Hopke: Department of Chemistry, Clarkson University, Potsdam, New York 13699, USA.

Abstract

Many chemical and environmental data sets are complicated by the existence of fully missing values or censored values known to lie below detection thresholds. For example, week-long samples of airborne particulate matter were obtained at Alert, NWT, Canada, between 1980 and 1991, where some of the concentrations of 24 particulate constituents were coarsened in the sense of being either fully missing or below detection limits. To facilitate scientific analysis, it is appealing to create complete data by filling in missing values so that standard complete-data methods can be applied. We briefly review commonly used strategies for handling missing values and focus on the multiple-imputation approach, which generally leads to valid inferences when faced with missing data. Three statistical models are developed for multiply imputing the missing values of airborne particulate matter. We expect that these models are useful for creating multiple imputations in a variety of incomplete multivariate time series data sets.

MeSH Term

Air Pollutants
Arctic Regions
Biometry
Data Interpretation, Statistical
Models, Statistical
Multivariate Analysis
Northwest Territories
Time Factors

Chemicals

Air Pollutants

Word Cloud

Created with Highcharts 10.0.0missingdatavaluesparticulatesetsfullydetectionairbornematterconcentrationsmodelsmultivariateManychemicalenvironmentalcomplicatedexistencecensoredknownliethresholdsexampleweek-longsamplesobtainedAlertNWTCanada1980199124constituentscoarsenedsenseeitherlimitsfacilitatescientificanalysisappealingcreatecompletefillingstandardcomplete-datamethodscanappliedbrieflyreviewcommonlyusedstrategieshandlingfocusmultiple-imputationapproachgenerallyleadsvalidinferencesfacedThreestatisticaldevelopedmultiplyimputingexpectusefulcreatingmultipleimputationsvarietyincompletetimeseriesMultipleimputationbelow-thresholdmeasurements:time-seriespollutantsArctic

Similar Articles

Cited By (27)