Download PDF (external access)

7th European Symposium on Adaptive Agents and Multi-Agent Systems Maastricht, NETHERLANDS, 2007, Date: 2007/01/01, Location: NETHERLANDS, Maastricht

Publication date: 2008-01-01
Volume: 4865 Pages: 224 - 238
ISSN: 3540779477, 978-3-540-77947-6
Publisher: Springer-verlag berlin; HEIDELBERGER PLATZ 3, D-14197 BERLIN, GERMANY

Adaptive agents and multi-agent systems

Author:

Vrancx, Peter
Verbeeck, Katja ; Nowe, Ann

Keywords:

Science & Technology, Technology, Computer Science, Artificial Intelligence, Computer Science, Cybernetics, Computer Science

Abstract:

Learning Automata (LA) were recently shown to be valuable tools for designing Multi-Agent Reinforcement Learning algorithms. One of the principal contributions of LA theory is that a set of decentralized, independent learning automata is able to control a finite Markov Chain with unknown transition probabilities and rewards. This result was recently extended to Markov Games and analyzed with the use of limiting games. In this paper we continue this analysis but we assume here that our agents are fully ignorant about the other agents in the environment, i.e. they can only observe themselves; they do not know how many other agents are present in the environment, the actions these other agents took, the rewards they received for this, or the location they occupy in the state space. We prove that in Markov Games, where agents have this limited type of observability, a network of independent LA is still able to converge to an equilibrium point of the underlying limiting game, provided a common ergodic assumption and provided the agents do not interfere each other's transition probabilities.