Using linear programming duality for solving finite horizon Dec-POMDPs

Name: Using linear programming duality for solving finite horizon Dec-POMDPs
Keywords: Dec-POMDPs, [INFO.INFO-GT] Computer Science [cs]/Computer Science and Game Theory [cs.GT], [INFO.INFO-MA] Computer Science [cs]/Multiagent Systems [cs.MA], decentralized problems, [INFO.INFO-RO] Computer Science [cs]/Operations Research [math.OC]

Aras, Raghav; Dutech, Alain; Charpillet, François

Found an issue? Give us feedback

Halarrow_drop_down

Hal

External research report . 2008

Data sources: Hal

INRIA2

External research report . 2008

Data sources: INRIA2

INRIA a CCSD electronic archive server

External research report . 2008

Data sources: INRIA a CCSD electronic archive server

Using linear programming duality for solving finite horizon Dec-POMDPs

descriptionPublicationkeyboard_double_arrow_right External research report 01 Jan 2008 English

Authors: Aras, Raghav; Dutech, Alain; Charpillet, François;

Using linear programming duality for solving finite horizon Dec-POMDPs

- Summary
- Subjects
- Metrics

Abstract

This paper studies the problem of finding an optimal finite horizon joint policy for a decentralized partially observable Markov decision process (Dec-POMDP). We present a new algorithm for finding an optimal joint policy. The algorithm is based on the fact that the necessary condition for a joint policy to be optimal is that it be locally optimal (that is, a Nash equilibrium). Through the application of linear programming duality, the necessary condition can be transformed to a nonlinear program which can then further be transformed to a 0-1 mixed integer linear program (MILP) whose optimal solution is an optimal joint policy (in the sequence form). The proposed algorithm thus consists of solving this 0-1 MILP. Computational experience of the 0-1 MILP on two and three agent DEC-POMDPs gives mixed results. On some problems it is faster than existing algorithms, on others it is slower.

Related Organizations

Keywords

Dec-POMDPs, [INFO.INFO-GT] Computer Science [cs]/Computer Science and Game Theory [cs.GT], [INFO.INFO-MA] Computer Science [cs]/Multiagent Systems [cs.MA], decentralized problems, [INFO.INFO-RO] Computer Science [cs]/Operations Research [math.OC]

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	0
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Average
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Average
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Average

Found an issue? Give us feedback

0

Average

Green

Related to Research communities

INRIA