Multiple Imputation of Binary Multilevel Missing not at Random Data

descriptionPublicationkeyboard_double_arrow_right Article 24 Feb 2020 Germany English Publisher:Oxford University Press (OUP)Journal:Journal of the Royal Statistical Society Series C: Applied Statistics, volume 69, pages 547-564 (issn: 0035-9254, eissn: 1467-9876,

Copyright policy )

Authors: Hammon, Angelina; Zinn, Sabine;

doi: 10.1111/rssc.12401

handle: 10419/222432

Multiple Imputation of Binary Multilevel Missing not at Random Data

- Summary
- Subjects
- Metrics

Abstract

SummaryWe introduce a selection model-based multilevel imputation approach to be used within the fully conditional specification framework for multiple imputation. Concretely, we apply a censored bivariate probit model to describe binary variables assumed to be missing not at random. The first equation of the model defines the regression model for the missing data mechanism. The second equation specifies the regression model of the variable to be imputed. The non-random selection of the binary data is mapped by correlations between the error terms of the two regression models. Hierarchical data structures are modelled by random intercepts in both equations. To fit the novel imputation model we use maximum likelihood and adaptive Gauss–Hermite quadrature. A comprehensive simulation study shows the overall performance of the approach. We test its usefulness for empirical research by applying it to a common problem in social scientific research: the emergence of educational aspirations. Our software is designed to be used in the R package mice.

Country

Germany

Related Organizations

University of Bamberg
Germany
German Institute for Economic Research
Germany
Leibniz Association
Germany

Keywords

multiple imputation, ddc:330, Missingness not at random, fully conditional specification, missingness not at random, selection model, Applications of statistics, 300, Multilevel data, multilevel data, Fully conditional speciﬁcation, Selection model, Multiple imputation, Fully conditional specification

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	8
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Top 10%
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Average
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Top 10%

Found an issue? Give us feedback

8

Top 10%

Average

Top 10%

Green

hybrid

Fields of Science

Fields of Science