IEEJ Transactions on Electronics, Information and Systems
Online ISSN : 1348-8155
Print ISSN : 0385-4221
ISSN-L : 0385-4221
<Softcomputing, Learning>
A Reinforcement Learning Model with Function of Generating Macro-Actions in Grid-World Maze Problems and a Study on its Learning Property
Hiroshi OndaSeiichi Ozawa
Author information
JOURNAL FREE ACCESS

2009 Volume 129 Issue 4 Pages 735-743

Details
Abstract
A macro-action is a typical series of useful actions that brings high expected rewards to an agent. Murata et al. have proposed an Actor-Critic model which can generate macro-actions automatically based on the information on state values and visiting frequency of states. However, their model has not assumed that generated macro-actions are utilized for leaning different tasks. In this paper, we extend the Murata's model such that generated macro-actions can help an agent learn an optimal policy quickly in multi-task Grid-World (MTGW) maze problems. The proposed model is applied to two MTGW problems, each of which consists of six different maze tasks. From the experimental results, it is concluded that the proposed model could speed up learning if macro-actions are generated in the so-called correlated regions.
Content from these authors
© 2009 by the Institute of Electrical Engineers of Japan
Previous article Next article
feedback
Top