Handling multivariable missing data in causal mediation analysis estimating interventional effects.

S Ghazaleh Dashti, Katherine J Lee, Julie A Simpson, John B Carlin, Margarita Moreno-Betancur
Author Information
  1. S Ghazaleh Dashti: Clinical Epidemiology and Biostatistics Unit, Murdoch Children's Research Institute, Melbourne, Victoria, Australia.
  2. Katherine J Lee: Clinical Epidemiology and Biostatistics Unit, Murdoch Children's Research Institute, Melbourne, Victoria, Australia.
  3. Julie A Simpson: Centre for Epidemiology and Biostatistics, Melbourne School of Population and Global Health, University of Melbourne, Melbourne, Victoria, Australia.
  4. John B Carlin: Clinical Epidemiology and Biostatistics Unit, Murdoch Children's Research Institute, Melbourne, Victoria, Australia.
  5. Margarita Moreno-Betancur: Clinical Epidemiology and Biostatistics Unit, Murdoch Children's Research Institute, Melbourne, Victoria, Australia.

Abstract

The interventional effects approach to causal mediation analysis is increasingly common in epidemiologic research given its potential to address policy-relevant questions about hypothetical mediator interventions. Multiple imputation is widely used for handling multivariable missing data in epidemiologic studies. However, guidance is lacking on best practice for using multiple imputation when estimating interventional mediation effects, specifically regarding the role of missingness mechanism in the performance of the method, how to appropriately specify the multiple imputation model when g-computation is used for effect estimation, and appropriate variance estimation. To address this gap, we conducted simulations based on the Victorian Adolescent Health Cohort Study. We considered seven missingness mechanisms, involving varying assumptions regarding the influence of an intermediate confounder, a mediator, and/or the outcome on missingness in key variables. We compared the performance of complete-case analysis, six multiple imputation approaches by fully conditional specification, differing in how the imputation model was tailored, and a "substantive model compatible" multiple imputation-fully conditional specification approach. We evaluated MIBoot (multiple imputation, then bootstrap) and BootMI (bootstrap, then multiple imputation) approaches for variance estimation. All multiple imputation approaches, apart from those clearly diverging from best practice, yielded approximately unbiased estimates when none of the intermediate confounder, mediator, and outcome variables influenced missingness in any of these variables and non-negligible bias otherwise. We observed the largest bias for interventional effects when each of the intermediate confounders, mediators, and outcomes influenced their own missingness. BootMI returned variance estimates with smaller bias than MIBoot.

Word Cloud

Created with Highcharts 10.0.0imputationmultiplemissingnessinterventionaleffectsmediationanalysismediatormodelestimationvarianceintermediatevariablesapproachesbiasapproachcausalepidemiologicaddressusedmultivariablemissingdatabestpracticeestimatingregardingperformanceconfounderoutcomeconditionalspecificationMIBootbootstrapBootMIestimatesinfluencedincreasinglycommonresearchgivenpotentialpolicy-relevantquestionshypotheticalinterventionsMultiplewidelyhandlingstudiesHoweverguidancelackingusingspecificallyrolemechanismmethodappropriatelyspecifyg-computationeffectappropriategapconductedsimulationsbasedVictorianAdolescentHealthCohortStudyconsideredsevenmechanismsinvolvingvaryingassumptionsinfluenceand/orkeycomparedcomplete-casesixfullydifferingtailored"substantivecompatible"imputation-fullyevaluatedapartclearlydivergingyieldedapproximatelyunbiasednonenon-negligibleotherwiseobservedlargestconfoundersmediatorsoutcomesreturnedsmallerHandling

Similar Articles

Cited By