文章基本信息

标题：Online Bahavior Aquisition of an Agent based on Coaching as Learning Assistance
本地全文：下载
作者：Masakazu HIROKAWA ; Kenji SUZUKI
期刊名称：人工知能学会論文誌
印刷版ISSN：1346-0714
电子版ISSN：1346-8030
出版年度：2010
卷号：25
期号：6
页码：694-702
DOI：10.1527/tjsai.25.694
出版社：The Japanese Society for Artificial Intelligence
摘要：This paper describes a novel methodology, namely ``Coaching'', which allows humans to give a subjective evaluation to an agent in an iterative manner. This is an interactive learning method to improve the reinforcement learning by modifying a reward function dynamically according to given evaluations by a trainer and the learning situation of the agent. We demonstrate that the agent can learn different reward functions by given instructions such as ``good or bad'' by human's observation, and can also obtain a set of behavior based on the learnt reward functions through several experiments.
关键词：HAI ; reinforcement learning ; coaching