The effect of bootstrapping in multi-automata reinforcement learning

Peeters, Maarten; Verbeeck, Katja; Nowe, Ann

doi:10.1109/ADPRL.2007.368172

2007 IEEE International Symposium on Approximate Dynamic Programming and Reinforcement Learning

The effect of bootstrapping in multi-automata reinforcement learning

Author:

Peeters, Maarten

Verbeeck, Katja ; Nowe, Ann

Keywords:

Science & Technology, Technology, Computer Science, Artificial Intelligence, Computer Science

Abstract:

Learning Automata are shown to be an excellent tool for creating learning multi-agent systems. Most algorithms used in current automata research expect the environment to end in an explicit end-stage. In this end-stage the rewards are given to the learning automata (i.e. Monte Carlo updating). This is however unfeasible in sequential decision problems with infinite horizon where no such end-stage exists. In this paper we propose a new algorithm based on one-step returns that uses bootstrapping to find good equilibrium paths in multi-stage games.

2007 IEEE International Symposium on Approximate Dynamic Programming and Reinforcement Learning The effect of bootstrapping in multi-automata reinforcement learning

Author:

Keywords:

Abstract:

2007 IEEE International Symposium on Approximate Dynamic Programming and Reinforcement Learning

The effect of bootstrapping in multi-automata reinforcement learning