
arXiv: 1604.01942
We consider Markov decision processes (MDP) as generators of sequences of probability distributions over states. A probability distribution is p-synchronizing if the probability mass is at least p in a single state, or in a given set of states. We consider four temporal synchronizing modes: a sequence of probability distributions is always p-synchronizing, eventually p-synchronizing, weakly p-synchronizing, or strongly p-synchronizing if, respectively, all, some, infinitely many, or all but finitely many distributions in the sequence are p-synchronizing. For each synchronizing mode, an MDP can be (i) sure winning if there is a strategy that produces a 1-synchronizing sequence; (ii) almost-sure winning if there is a strategy that produces a sequence that is, for all epsilon > 0, a (1-epsilon)-synchronizing sequence; (iii) limit-sure winning if for all epsilon > 0, there is a strategy that produces a (1-epsilon)-synchronizing sequence. We provide fundamental results on the expressiveness, decidability, and complexity of synchronizing properties for MDPs. For each synchronizing mode, we consider the problem of deciding whether an MDP is sure, almost-sure, or limit-sure winning, and we establish matching upper and lower complexity bounds of the problems: for all winning modes, we show that the problems are PSPACE-complete for eventually and weakly synchronizing, and PTIME-complete for always and strongly synchronizing. We establish the memory requirement for winning strategies, and we show that all winning modes coincide for always synchronizing, and that the almost-sure and limit-sure winning modes coincide for weakly and strongly synchronizing.
arXiv admin note: substantial text overlap with arXiv:1402.2840, arXiv:1310.2935
FOS: Computer and information sciences, Computer Science - Logic in Computer Science, Probability in computer science (algorithm analysis, random structures, phase transitions, etc.), Formal Languages and Automata Theory (cs.FL), Analysis of algorithms and problem complexity, sequences of probability distributions, Modes of computation (nondeterministic, parallel, interactive, probabilistic, etc.), Computer Science - Formal Languages and Automata Theory, [INFO] Computer Science [cs], Markov decision processes, Logic in Computer Science (cs.LO), Models and methods for concurrent and distributed computing (process algebras, bisimulation, transition nets, etc.), Applications of game theory, complexity, synchronization
FOS: Computer and information sciences, Computer Science - Logic in Computer Science, Probability in computer science (algorithm analysis, random structures, phase transitions, etc.), Formal Languages and Automata Theory (cs.FL), Analysis of algorithms and problem complexity, sequences of probability distributions, Modes of computation (nondeterministic, parallel, interactive, probabilistic, etc.), Computer Science - Formal Languages and Automata Theory, [INFO] Computer Science [cs], Markov decision processes, Logic in Computer Science (cs.LO), Models and methods for concurrent and distributed computing (process algebras, bisimulation, transition nets, etc.), Applications of game theory, complexity, synchronization
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 11 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Top 10% | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Top 10% | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Top 10% |
