Distributionally Robust and Generalizable Inference

descriptionPublicationkeyboard_double_arrow_right Article , Preprint 01 Nov 2023Embargo end date: 01 Jan 2022 Switzerland Publisher:Institute of Mathematical StatisticsJournal:Statistical Science, volume 38 (issn: 0883-4237,

Copyright policy )Funded by:EC | CausalStats

Authors: Rothenhäusler, Dominik; Bühlmann, Peter;

doi: 10.1214/23-sts902 , 10.48550/arxiv.2209.09352

arXiv: 2209.09352

handle: 20.500.11850/651998

Distributionally Robust and Generalizable Inference

- Summary
- Subjects
- Metrics

Abstract

We discuss recently developed methods that quantify the stability and generalizability of statistical findings under distributional changes. In many practical problems, the data is not drawn i.i.d. from the target population. For example, unobserved sampling bias, batch effects, or unknown associations might inflate the variance compared to i.i.d. sampling. For reliable statistical inference, it is thus necessary to account for these types of variation. We discuss and review two methods that allow quantifying distribution stability based on a single dataset. The first method computes the sensitivity of a parameter under worst-case distributional perturbations to understand which types of shift pose a threat to external validity. The second method treats distributional shifts as random which allows assessing average robustness (instead of worst-case). Based on a stability analysis of multiple estimators on a single dataset, it integrates both sampling and distributional uncertainty into a single confidence interval.

Country

Switzerland

Related Organizations

ETH Zurich
Switzerland
Stanford University
United States

Keywords

Distributional robustness; external validity; generalizability; stability; uncertainty quantification, FOS: Computer and information sciences, distributional robustness, uncertainty quantification, Statistics, stability, Methodology (stat.ME), external validity, generalizability, Statistics - Methodology

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	1
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Average
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Average
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Average

Found an issue? Give us feedback

1

Average

Green

Fields of Science (5) View all

Fields of Science

Funded by