Forecaster’s Dilemma: Extreme Events and Forecast Evaluation

descriptionPublicationkeyboard_double_arrow_right Article , Preprint , Other literature type 01 Feb 2017Embargo end date: 01 Jan 2015 Germany Publisher:Institute of Mathematical StatisticsJournal:Statistical Science, volume 32 (issn: 0883-4237,

Copyright policy )

Authors: Lerch, Sebastian; Thorarinsdottir, Thordis L.; Ravazzolo, Francesco; Gneiting, Tilmann;

doi: 10.1214/16-sts588 , 10.48550/arxiv.1512.09244

arXiv: 1512.09244

Forecaster’s Dilemma: Extreme Events and Forecast Evaluation

- Summary
- Subjects
- Metrics

Abstract

In public discussions of the quality of forecasts, attention typically focuses on the predictive performance in cases of extreme events. However, the restriction of conventional forecast evaluation methods to subsets of extreme observations has unexpected and undesired effects, and is bound to discredit skillful forecasts when the signal-to-noise ratio in the data generating process is low. Conditioning on outcomes is incompatible with the theoretical assumptions of established forecast evaluation methods, thereby confronting forecasters with what we refer to as the forecaster's dilemma. For probabilistic forecasts, proper weighted scoring rules have been proposed as decision theoretically justifiable alternatives for forecast evaluation with an emphasis on extreme events. Using theoretical arguments, simulation experiments, and a real data study on probabilistic forecasts of U.S. inflation and gross domestic product growth, we illustrate and discuss the forecaster's dilemma along with potential remedies.

Country

Germany

Related Organizations

HITS GGMBH
Germany
Karlsruhe Institute of Technology
Germany
Heidelberg Institute for Theoretical Studies
Germany

Keywords

ddc:510, FOS: Computer and information sciences, proper weighted scoring rule, Diebold-Mariano test, predictive performance, likelihood ratio test, Neyman-Pearson lemma, probabilistic forecast, 510, Inference from stochastic processes and prediction, rare and extreme events, Methodology (stat.ME), Neyman–Pearson lemma, hindsight bias, Diebold–Mariano test, Applications of statistics to economics, Mathematics, info:eu-repo/classification/ddc/510, Statistics - Methodology

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	98
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Top 1%
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Top 10%
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Top 1%