首页    期刊浏览 2025年02月21日 星期五
登录注册

文章基本信息

  • 标题:Exemplar-Based Policy with Selectable Strategies and its Optimization Using GA
  • 本地全文:下载
  • 作者:Kokolo Ikeda ; Shigenobu Kobayashi ; Hajime Kita
  • 期刊名称:人工知能学会論文誌
  • 印刷版ISSN:1346-0714
  • 电子版ISSN:1346-8030
  • 出版年度:2010
  • 卷号:25
  • 期号:2
  • 页码:351-362
  • DOI:10.1527/tjsai.25.351
  • 出版社:The Japanese Society for Artificial Intelligence
  • 摘要:As an approach for dynamic control problems and decision making problems, usually formulated as Markov Decision Processes (MDPs), we focus direct policy search (DPS), where a policy is represented by a model with parameters, and the parameters are optimized so as to maximize the evaluation function by applying the parameterized policy to the problem. In this paper, a novel framework for DPS, an exemplar-based policy optimization using genetic algorithm (EBP-GA) is presented and analyzed. In this approach, the policy is composed of a set of virtual exemplars and a case-based action selector, and the set of exemplars are selected and evolved by a genetic algorithm. Here, an exemplar is a real or virtual, free-styled and suggestive information such as ``take the action A at the state S'' or ``the state S1 is better to attain than S2''. One advantage of EBP-GA is the generalization and localization ability for policy expression, based on case-based reasoning methods. Another advantage is that both the introduction of prior knowledge and the extraction of knowledge after optimization are relatively straightforward. These advantages are confirmed through the proposal of two new policy expressions, experiments on two different problems and their analysis.
  • 关键词:direct policy search ; genetic algorithm ; case-based reasoning ; Markov decision process ; exemplar
国家哲学社会科学文献中心版权所有