
doi: 10.1002/sim.9315
pmid: 35064581
AbstractMultiple imputation is a promising approach to handle missing data and is widely used in analysis of longitudinal clinical studies. A key consideration in the implementation of multiple imputation is to obtain accurate imputed values by specifying an imputation model that incorporates auxiliary variables potentially associated with missing variables. The use of informative auxiliary variables is known to be beneficial to make the missing at random assumption more plausible and help to reduce uncertainty of the imputations; however, it is not straightforward to pre‐specify them in many cases. We propose a data‐driven specification of the imputation model using Bayesian lasso in the context of longitudinal clinical study, and develop a built‐in function of the Bayesian lasso imputation model which is performed within the framework of multiple imputation using chained equations. A simulation study suggested that the Bayesian lasso imputation model worked well in a variety of longitudinal study settings, providing unbiased treatment effect estimates with well‐controlled type I error rates and coverage probabilities of the confidence interval; in contrast, ignorance of the informative auxiliary variables led to serious bias and inflation of type I error rate. Moreover, the Bayesian lasso imputation model offered higher statistical powers compared with conventional imputation methods. In our simulation study, the gains in statistical power were remarkable when the sample size was small relative to the number of auxiliary variables. An illustration through a real example also suggested that the Bayesian lasso imputation model could give smaller standard errors of the treatment effect estimate.
Models, Statistical, multiple imputation, Bayes Theorem, Bayesian Lasso, Applications of statistics to biology and medical sciences; meta analysis, longitudinal clinical study, missing data, Bias, Data Interpretation, Statistical, Humans, Computer Simulation, Longitudinal Studies
Models, Statistical, multiple imputation, Bayes Theorem, Bayesian Lasso, Applications of statistics to biology and medical sciences; meta analysis, longitudinal clinical study, missing data, Bias, Data Interpretation, Statistical, Humans, Computer Simulation, Longitudinal Studies
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 6 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Top 10% | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Average | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Top 10% |
