Combining multiple imputation and meta-analysis with individual participant data

Stat Med. 2013 Nov 20;32(26):4499-514. doi: 10.1002/sim.5844. Epub 2013 May 24.

Abstract

Multiple imputation is a strategy for the analysis of incomplete data such that the impact of the missingness on the power and bias of estimates is mitigated. When data from multiple studies are collated, we can propose both within-study and multilevel imputation models to impute missing data on covariates. It is not clear how to choose between imputation models or how to combine imputation and inverse-variance weighted meta-analysis methods. This is especially important as often different studies measure data on different variables, meaning that we may need to impute data on a variable which is systematically missing in a particular study. In this paper, we consider a simulation analysis of sporadically missing data in a single covariate with a linear analysis model and discuss how the results would be applicable to the case of systematically missing data. We find in this context that ensuring the congeniality of the imputation and analysis models is important to give correct standard errors and confidence intervals. For example, if the analysis model allows between-study heterogeneity of a parameter, then we should incorporate this heterogeneity into the imputation model to maintain the congeniality of the two models. In an inverse-variance weighted meta-analysis, we should impute missing data and apply Rubin's rules at the study level prior to meta-analysis, rather than meta-analyzing each of the multiple imputations and then combining the meta-analysis estimates using Rubin's rules. We illustrate the results using data from the Emerging Risk Factors Collaboration.

Keywords: Rubin's rules; individual participant data; meta-analysis; missing data; multiple imputation.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Blood Pressure / physiology
  • Cholesterol, LDL / blood
  • Computer Simulation
  • Confidence Intervals*
  • Humans
  • Meta-Analysis as Topic*
  • Models, Statistical*

Substances

  • Cholesterol, LDL