首页    期刊浏览 2024年11月29日 星期五
登录注册

文章基本信息

  • 标题:Reinforcement learning by GA using Importance Sampling
  • 本地全文:下载
  • 作者:Chikao Tsuchiya ; Hajime Kimura ; Jun Sakuma
  • 期刊名称:人工知能学会論文誌
  • 印刷版ISSN:1346-0714
  • 电子版ISSN:1346-8030
  • 出版年度:2005
  • 卷号:20
  • 期号:1
  • 页码:1-10
  • DOI:10.1527/tjsai.20.1
  • 出版社:The Japanese Society for Artificial Intelligence
  • 摘要:Reinforcement Learning (RL) handles policy search problems: searching a mapping from state space to action space. However RL is based on gradient methods and as such, cannot deal with problems with multimodal landscape. In contrast, though Genetic Algorithm (GA) is promising to deal with them, it seems to be unsuitable for policy search problems from the viewpoint of the cost of evaluation. Minimal Generation Gap (MGG), used as a generation-alternation model in GA, generates many offspring from two or more parents selected from a population. Therefore, evaluating policies of generated offspring requires much trial and error (i.e. interaction between an agent and an environment). In this paper, we incorporate importance sampling into the framework of MGG in order to reduce the cost of evaluation on policy search. The proposed techniques are applied to Markov Decision Process (MDP) with multimodal landscape. The experimental results show that these techniques can reduce the number of interaction between an agent and an environment, and also mean that MGG and importance sampling are good for each other.
  • 关键词:GA ; reinforcement learning ; direct policy search ; importance sampling
国家哲学社会科学文献中心版权所有