publication . Research . 1991

The detection of influential subsets in linear regression using an influence matrix

Peña, Daniel; Yohai, Víctor J.;
Open Access English
  • Published: 01 Mar 1991
  • Country: Spain
Abstract
This paper presents a new method to identify influential subsets in linear regression problems. The procedure uses the eigenstructure of an influence matrix which is defined as the matrix of uncentered covariance of the effect on the whole data set of deleting each observation, normalized to include the univariate Cook's statistics in the diagonal. It is shown that points in an influential subset will appear with large weight in at least one of the eigenvector linked to the largest eigenvalues in this influence matrix. The method is illustrated with several well-known examples in the literature, and in all of them it succeeds in identifying the relevant influent...
Subjects
free text keywords: Eigenvectors, Masking, Multivariate Influence, Outliers, Economía
Related Organizations

Chatterjee, S., and Hadi A.S. (1986), "Infiuential Observations, High Leverage Points, and Outliers in Lineal Regression," Statistical Science, 1, 3, 379-416. [OpenAIRE]

Cook, RD. (1979), "Infiuential Observations in Linear Regression," Journal 01 A merican Statistical Association, 74, 169-174.

Cook, RD. and Weisberg, S. (1982), Residuals and Influence in Regression. Chapman and Hall, New York. [OpenAIRE]

Daniel, C., and Wood, F.S. (1980), Fitting Equations To Data, John Wiley and Sonso Gray, J.B., and Ling, RF. (1984), "K-Clustering as a Detection Tool for Influential Subsets in Regression," Technometrics, 26, 305-330.

Hampel, F.R. (1974), "The Infiuence Curve and its Role in Robust Estimation," Journal 01 American Statistical Association, 69, 383-393. [OpenAIRE]

Hawkins, D.M., Bradu, D. and Kass, G.V. (1984), "Location of Several Outliers in Multiple Regression Data Using Elemental Sets," Technometrics, 26, 197-208.

Hocking, RR (1984), "Discussion of Gray and Ling paper," Technometrics, 26, 321­ Kianifard, F., and Swallow, "r. (1990), "A Monte Carlo Comparison offive Procedures for Identifying Outliers in Lineal Regression," Communication in Statistics (Theory and Methods), 19, 1913-1938.

Mararinghe, M.G. (1985), "A Multistage Procedure for Detecting Several Outliers in Linear Regression," Technometrics, 27, 395-399.

Rousseeuw, P.J. and Zomeren, B.C. (1990), "Unmasking Multivariate Outliers and Leverage Points," Journal 01 American Statistical Association, 85, 633-651.

Powered by OpenAIRE Open Research Graph
Any information missing or wrong?Report an Issue