IEEJ Transactions on Electronics, Information and Systems
Online ISSN : 1348-8155
Print ISSN : 0385-4221
ISSN-L : 0385-4221
<Robotics>
A Reinforcement Learning Using a Stochastic Gradient Method with Memory-Based Learning
Takafumi YamadaSatoshi Yamaguchi
Author information
JOURNAL FREE ACCESS

2008 Volume 128 Issue 7 Pages 1123-1130

Details
Abstract
In this paper, for agents working on POMDP, a learning algorithm combining the memory-less learning and the memory-based learning is proposed. At first stage of the propposed algorithm, memory-less learning is applied. As a memory-less learning algorithm, the stochastic gradient method is employed. While the first stage, a state-action set series that accmplish the task is stored in memory. In the second stage, the memory-based learning is applied. In this process, only the series that obtained the first stage is used, so that this method is able to reduce the number of required memory effectively.
The proposed algorithm are applied three kinds of simulation to be compared with memory-less learning algorithm. Through the computer simulations, it shown that the proposed algorithms works effectively in POMDP than ordinary memory-less learnings.
Content from these authors
© 2008 by the Institute of Electrical Engineers of Japan
Previous article Next article
feedback
Top