
ObjectiveTo identify the most appropriate imputation method for missing data in the HCUP State Inpatient Databases (SID) and assess the impact of different missing data methods on racial disparities research.Data Sources/Study SettingHCUP SID.Study DesignA novel simulation study compared four imputation methods (random draw, hot deck, joint multiple imputation [MI], conditional MI) for missing values for multiple variables, including race, gender, admission source, median household income, and total charges. The simulation was built on real data from the SID to retain their hierarchical data structures and missing data patterns. Additional predictive information from the U.S. Census and American Hospital Association (AHA) database was incorporated into the imputation.Principal FindingsConditional MI prediction was equivalent or superior to the best performing alternatives for all missing data structures and substantially outperformed each of the alternatives in various scenarios.ConclusionsConditional MI substantially improved statistical inferences for racial health disparities research with the SID.
Models, Statistical, Racial Groups, Hospitals, Patient Admission, Sex Factors, Research Design, Data Interpretation, Statistical, Humans, Computer Simulation, Healthcare Disparities
Models, Statistical, Racial Groups, Hospitals, Patient Admission, Sex Factors, Research Design, Data Interpretation, Statistical, Humans, Computer Simulation, Healthcare Disparities
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 22 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Top 10% | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Top 10% | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Average |
