
arXiv: 2201.06274
Gridworlds have been long-utilised in AI research, particularly in reinforcement learning, as they provide simple yet scalable models for many real-world applications such as robot navigation, emergent behaviour, and operations research. We initiate a study of gridworlds using the mathematical framework of reconfigurable systems and state complexes due to Abrams, Ghrist & Peterson. State complexes represent all possible configurations of a system as a single geometric space, thus making them conducive to study using geometric, topological, or combinatorial methods. The main contribution of this work is a modification to the original Abrams, Ghrist & Peterson setup which we introduce to capture agent braiding and thereby more naturally represent the topology of gridworlds. With this modification, the state complexes may exhibit geometric defects (failure of Gromov's Link Condition). Serendipitously, we discover these failures occur exactly where undesirable or dangerous states appear in the gridworld. Our results therefore provide a novel method for seeking guaranteed safety limitations in discrete task environments with single or multiple agents, and offer useful safety information (in geometric and topological forms) for incorporation in or analysis of machine learning systems. More broadly, our work introduces tools from geometric group theory and combinatorics to the AI community and demonstrates a proof-of-concept for this geometric viewpoint of the task domain through the example of simple gridworld environments.
17 pages, 12 figures, 4 appendices; some parts rewritten and rearranged to improve exposition, no changes to mathematical content
FOS: Computer and information sciences, Computer Science - Artificial Intelligence, 57Z25 (Primary) 68R01, 51F99 (Secondary), G.2.0, Geometric Topology (math.GT), Metric Geometry (math.MG), I.2.0, Mathematics - Geometric Topology, Artificial Intelligence (cs.AI), Mathematics - Metric Geometry, FOS: Mathematics, Mathematics - Combinatorics, Computer Science - Multiagent Systems, Combinatorics (math.CO), I.2.0; G.2.0, Multiagent Systems (cs.MA)
FOS: Computer and information sciences, Computer Science - Artificial Intelligence, 57Z25 (Primary) 68R01, 51F99 (Secondary), G.2.0, Geometric Topology (math.GT), Metric Geometry (math.MG), I.2.0, Mathematics - Geometric Topology, Artificial Intelligence (cs.AI), Mathematics - Metric Geometry, FOS: Mathematics, Mathematics - Combinatorics, Computer Science - Multiagent Systems, Combinatorics (math.CO), I.2.0; G.2.0, Multiagent Systems (cs.MA)
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 0 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Average | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Average | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Average |
