Some Simplifications for the Expectation-Maximization (EM) Algorithm: The Linear Regression Model Case

Name: Some Simplifications for the Expectation-Maximization (EM) Algorithm: The Linear Regression Model Case
Creator: Griffith, Daniel A.
Keywords: Methodology (stat.ME), FOS: Computer and information sciences, Methodology

Griffith, Daniel A.

Found an issue? Give us feedback

arXiv.org e-Print Ar...arrow_drop_down

arXiv.org e-Print Archive

Preprint . 2025

Data sources: arXiv.org e-Print Archive

https://dx.doi.org/10.48550/ar...

Article . 2025

License: arXiv Non-Exclusive Distribution

Data sources: Datacite

Some Simplifications for the Expectation-Maximization (EM) Algorithm: The Linear Regression Model Case

descriptionPublicationkeyboard_double_arrow_right Article , Preprint 01 Jan 2025Embargo end date: 01 Jan 2025Publisher:arXiv

Authors: Griffith, Daniel A.;

doi: 10.48550/arxiv.2509.19461

arXiv: 2509.19461

Some Simplifications for the Expectation-Maximization (EM) Algorithm: The Linear Regression Model Case

- Summary
- Subjects
- Metrics

Abstract

The EM algorithm is a generic tool that offers maximum likelihood solutions when datasets are incomplete with data values missing at random or completely at random. At least for its simplest form, the algorithm can be rewritten in terms of an ANCOVA regression specification. This formulation allows several analytical results to be derived that permit the EM algorithm solution to be expressed in terms of new observation predictions and their variances. Implementations can be made with a linear regression or a nonlinear regression model routine, allowing missing value imputations, even when they must satisfy constraints. Fourteen example datasets gleaned from the EM algorithm literature are reanalyzed. Imputation results have been verified with SAS PROC MI. Six theorems are proved that broadly contextualize imputation findings in terms of the theory, methodology, and practice of statistical science.

23 pages, 7 tables, 1 figure, former Interstat (now defunct) publication

Keywords

Methodology (stat.ME), FOS: Computer and information sciences, Methodology

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	0
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Average
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Average
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Average

Found an issue? Give us feedback

0

Average

Green