Downloads provided by UsageCounts
pmid: 29339817
pmc: PMC5770455
AbstractSince Alan Turing envisioned artificial intelligence, technical progress has often been measured by the ability to defeat humans in zero-sum encounters (e.g., Chess, Poker, or Go). Less attention has been given to scenarios in which human–machine cooperation is beneficial but non-trivial, such as scenarios in which human and machine preferences are neither fully aligned nor fully in conflict. Cooperation does not require sheer computational power, but instead is facilitated by intuition, cultural norms, emotions, signals, and pre-evolved dispositions. Here, we develop an algorithm that combines a state-of-the-art reinforcement-learning algorithm with mechanisms for signaling. We show that this algorithm can cooperate with people and other algorithms at levels that rival human cooperation in a variety of two-player repeated stochastic games. These results indicate that general human–machine cooperation is achievable using a non-trivial, but ultimately simple, set of algorithmic mechanisms.
FOS: Computer and information sciences, Stochastic Processes, Computer Science - Artificial Intelligence, Science, Communication, [INFO.INFO-DS]Computer Science [cs]/Data Structures and Algorithms [cs.DS], Q, Article, [INFO.INFO-AI]Computer Science [cs]/Artificial Intelligence [cs.AI], 004, Artificial Intelligence (cs.AI), [INFO.INFO-LG]Computer Science [cs]/Machine Learning [cs.LG], Artificial Intelligence, Humans, [INFO.INFO-HC]Computer Science [cs]/Human-Computer Interaction [cs.HC], Cooperative Behavior, B- ECONOMIE ET FINANCE, Algorithms
FOS: Computer and information sciences, Stochastic Processes, Computer Science - Artificial Intelligence, Science, Communication, [INFO.INFO-DS]Computer Science [cs]/Data Structures and Algorithms [cs.DS], Q, Article, [INFO.INFO-AI]Computer Science [cs]/Artificial Intelligence [cs.AI], 004, Artificial Intelligence (cs.AI), [INFO.INFO-LG]Computer Science [cs]/Machine Learning [cs.LG], Artificial Intelligence, Humans, [INFO.INFO-HC]Computer Science [cs]/Human-Computer Interaction [cs.HC], Cooperative Behavior, B- ECONOMIE ET FINANCE, Algorithms
| citations This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 168 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Top 1% | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Top 10% | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Top 1% |
| views | 12 | |
| downloads | 13 |

Views provided by UsageCounts
Downloads provided by UsageCounts