首页    期刊浏览 2024年11月30日 星期六
登录注册

文章基本信息

  • 标题:Adaptive integration of habits into depth-limited planning defines a habitual-goal–directed spectrum
  • 本地全文:下载
  • 作者:Mehdi Keramati ; Peter Smittenaar ; Raymond J. Dolan
  • 期刊名称:Proceedings of the National Academy of Sciences
  • 印刷版ISSN:0027-8424
  • 电子版ISSN:1091-6490
  • 出版年度:2016
  • 卷号:113
  • 期号:45
  • 页码:12868-12873
  • DOI:10.1073/pnas.1609094113
  • 语种:English
  • 出版社:The National Academy of Sciences of the United States of America
  • 摘要:Behavioral and neural evidence reveal a prospective goal-directed decision process that relies on mental simulation of the environment, and a retrospective habitual process that caches returns previously garnered from available choices. Artificial systems combine the two by simulating the environment up to some depth and then exploiting habitual values as proxies for consequences that may arise in the further future. Using a three-step task, we provide evidence that human subjects use such a normative plan-until-habit strategy, implying a spectrum of approaches that interpolates between habitual and goal-directed responding. We found that increasing time pressure led to shallower goal-directed planning, suggesting that a speed-accuracy tradeoff controls the depth of planning with deeper search leading to more accurate evaluation, at the cost of slower decision-making. We conclude that subjects integrate habit-based cached values directly into goal-directed evaluations in a normative manner.
  • 关键词:planning ; habit ; reinforcement learning ; speed/accuracy tradeoff ; tree-based evaluation
国家哲学社会科学文献中心版权所有