
arXiv: 1201.5786
SummaryThe ultimate goal of regression analysis is to obtain information about the conditional distribution of a response given a set of explanatory variables. This goal is, however, seldom achieved because most established regression models estimate only the conditional mean as a function of the explanatory variables and assume that higher moments are not affected by the regressors. The underlying reason for such a restriction is the assumption of additivity of signal and noise. We propose to relax this common assumption in the framework of transformation models. The novel class of semiparametric regression models proposed herein allows transformation functions to depend on explanatory variables. These transformation functions are estimated by regularized optimization of scoring rules for probabilistic forecasts, e.g. the continuous ranked probability score. The corresponding estimated conditional distribution functions are consistent. Conditional transformation models are potentially useful for describing possible heteroscedasticity, comparing spatially varying distributions, identifying extreme events, deriving prediction intervals and selecting variables beyond mean regression effects. An empirical investigation based on a heteroscedastic varying-coefficient simulation model demonstrates that semiparametric estimation of conditional distribution functions can be more beneficial than kernel-based non-parametric approaches or parametric generalized additive models for location, scale and shape.
Statistics and Probability, FOS: Computer and information sciences, 610 Medicine & health, 10060 Epidemiology, Biostatistics and Prevention Institute (EBPI), 62H12, 62G08, 62J02, 62J07, Methodology (stat.ME), 1804 Statistics, Probability and Uncertainty, 2613 Statistics and Probability, Statistics, Probability and Uncertainty, Statistics - Methodology
Statistics and Probability, FOS: Computer and information sciences, 610 Medicine & health, 10060 Epidemiology, Biostatistics and Prevention Institute (EBPI), 62H12, 62G08, 62J02, 62J07, Methodology (stat.ME), 1804 Statistics, Probability and Uncertainty, 2613 Statistics and Probability, Statistics, Probability and Uncertainty, Statistics - Methodology
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 77 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Top 1% | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Top 10% | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Top 10% |
