Learning the Stackelberg Equilibrium in a Newsvendor Game

descriptionPublicationkeyboard_double_arrow_right Article , Conference object , Part of book or chapter of book 30 May 2023 Italy Publisher:IEEE Computer SocietyJournal:International Joint Conference on Autonomous Agents and Multiagent Systems (issn: 1558-2914,

Copyright policy )

Authors: Cesa-Bianchi Nicolò; Cesari Tommaso; Osogami Takayuki; Scarsini Marco; Wasserkrug Segev;

doi: 10.65109/hshm6853

handle: 11385/228598

Learning the Stackelberg Equilibrium in a Newsvendor Game

- Summary
- Subjects
- Metrics

Abstract

We study a repeated newsvendor game between a supplier and a retailer who want to maximize their respective profits without full knowledge of the problem parameters. After characterizing the uniqueness of the Stackelberg equilibrium of the stage game with complete information, we show that even with partial knowledge of the joint distribution of demand and production cost, natural learning dynamics guarantee convergence of the supplier and retailer's joint strategy profile to the Stackelberg equilibrium of the stage game. We also prove finite-time bounds on the supplier's regret and asymptotic bounds on the retailer's regret, where the specific rates depend on the type of knowledge preliminarily available to the players. Finally, we empirically confirm our theoretical findings on synthetic data.

Country

Italy

Related Organizations

Ibb University
Yemen
Guido Carli Free International University for Social Studies
Italy
University of Ottawa
Canada
University of Milan
Italy

Keywords

Regret minimization; Supply chain analysis; Newsvendor game; Online learning; Stackelberg equilibrium

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	0
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Average
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Average
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Average

Found an issue? Give us feedback

0

Average

Upload OA version

Are you the author of this publication? Upload your Open Access version to Zenodo!

It’s fast and easy, just two clicks!

uploadUpload now