The Operator Approach to Entropy Games

Name: The Operator Approach to Entropy Games
Keywords: risk sensitive control, FOS: Computer and information sciences, 330, Policy iteration, Miscellaneous applications of operator theory, Perron eigenvalues, 91A15, 47H05, 93E20, Computer Science - Computer Science and Game Theory, FOS: Mathematics, stochastic games

Akian, Marianne; Gaubert, Stéphane; Grand-Clément, Julien; Guillaud, Jérémie

Found an issue? Give us feedback

downloadFull-Text

DROPS - Dagstuhl Res...arrow_drop_down

DROPS - Dagstuhl Research Online Publication Server (Schloss Dagstuhl - Leibniz Center for Informatics )

Article . 2017

License: CC BY

Full-Text: https://drops.dagstuhl.de/entities/document/10.4230/LIPIcs.STACS.2017.6

Data sources: Bielefeld Academic Search Engine (BASE)

Theory of Computing Systems

Article

License: CC BY

Data sources: UnpayWall

arXiv.org e-Print Archive

Preprint . 2019

Data sources: arXiv.org e-Print Archive

Dagstuhl Research Online Publication Server

Conference object . 2017

License: CC BY

Data sources: Dagstuhl Research Online Publication Server

Theory of Computing Systems

Article . 2019 . Peer-reviewed

License: Springer TDM

Data sources: Crossref

zbMATH Open

Article . 2019

Data sources: zbMATH Open

INRIA2

Conference object . 2017

Data sources: INRIA2

INRIA2

Article . 2019

Data sources: INRIA2

HAL Sorbonne Université

Article . 2019

Data sources: HAL Sorbonne Université

INRIA a CCSD electronic archive server

Conference object . 2017

Data sources: INRIA a CCSD electronic archive server

INRIA a CCSD electronic archive server

Article . 2019

Data sources: INRIA a CCSD electronic archive server

https://dx.doi.org/10.48550/ar...

Article . 2019

License: arXiv Non-Exclusive Distribution

Data sources: Datacite

https://dx.doi.org/10.1007/s00...

Article

Data sources: Microsoft Academic Graph

École Polytechnique, Université Paris-Saclay: HAL

Article . 2019

Data sources: Bielefeld Academic Search Engine (BASE)

MINES ParisTech: Open Archive (HAL)

Article . 2019

Data sources: Bielefeld Academic Search Engine (BASE)

The Operator Approach to Entropy Games

The operator approach to entropy games

descriptionPublicationkeyboard_double_arrow_right Article , Preprint , Conference object 30 May 2019Embargo end date: 01 Jan 2019 English Publisher:Springer Science and Business Media LLCJournal:Theory of Computing Systems, volume 63, pages 1,089-1,130 (issn: 1432-4350, eissn: 1433-0490,

Copyright policy )

Authors: Akian, Marianne; Gaubert, Stéphane; Grand-Clément, Julien; Guillaud, Jérémie;

doi: 10.1007/s00224-019-09925-z , 10.48550/arxiv.1904.05151

arXiv: 1904.05151

The Operator Approach to Entropy Games

- Summary
- Subjects
- Related research
  (1)
- Metrics

Abstract

Entropy games and matrix multiplication games have been recently introduced by Asarin et al. They model the situation in which one player (Despot) wishes to minimize the growth rate of a matrix product, whereas the other player (Tribune) wishes to maximize it. We develop an operator approach to entropy games. This allows us to show that entropy games can be cast as stochastic mean payoff games in which some action spaces are simplices and payments are given by a relative entropy (Kullback-Leibler divergence). In this way, we show that entropy games with a fixed number of states belonging to Despot can be solved in polynomial time. This approach also allows us to solve these games by a policy iteration algorithm, which we compare with the spectral simplex algorithm developed by Protasov.

29 pages. This is an extended version of the article with the same title and authors published in the Proceedings of the 34th Symposium on Theoretical Aspects of Computer Science (STACS 2017), Leibniz International Proceedings in Informatics (LIPIcs), volume 66, pages 6:1--6:14, 2017

Related Organizations

French National Centre for Scientific Research
France
University of Paris
France
French Institute for Research in Computer Science and Automation
France
PSL Research University
France
Columbia University
United States

View all View all

Keywords

risk sensitive control, FOS: Computer and information sciences, 330, Policy iteration, Miscellaneous applications of operator theory, Perron eigenvalues, 91A15, 47H05, 93E20, Computer Science - Computer Science and Game Theory, FOS: Mathematics, stochastic games, Mathematics - Optimization and Control, F.2.1 Numerical Algorithms and Problems, [MATH.MATH-OC] Mathematics [math]/Optimization and Control [math.OC], Shapley operators, 2-person games, 004, policy iteration, Stochastic games, stochastic differential games, 1991 Mathematics Subject Classification. G.2.1 Combinatorial algorithms, Optimization and Control (math.OC), Probability distributions: general theory, [MATH.MATH-OC]Mathematics [math]/Optimization and Control [math.OC], Stochastic games, Risk sensitive control, Computer Science and Game Theory (cs.GT), ddc: ddc:004

1 Research products, page 1 of 1

Log-Sum-Exp Neural Networks and Posynomial Models for Convex and Log-Log-Convex Data
2020IsVersionOf

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	9
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Top 10%
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Average
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Average

Found an issue? Give us feedback

9

Top 10%

Average

Green

hybrid

Related to Research communities

INRIA

The Operator Approach to Entropy Games

The Operator Approach to Entropy Games

1 Research products, page 1 of 1

Log-Sum-Exp Neural Networks and Posynomial Models for Convex and Log-Log-Convex Data