首页    期刊浏览 2024年12月02日 星期一
登录注册

文章基本信息

  • 标题:Scalable photonic reinforcement learning by time-division multiplexing of laser chaos
  • 本地全文:下载
  • 作者:Makoto Naruse ; Takatomo Mihana ; Hirokazu Hori
  • 期刊名称:Scientific Reports
  • 电子版ISSN:2045-2322
  • 出版年度:2018
  • 卷号:8
  • 期号:1
  • 页码:10890
  • DOI:10.1038/s41598-018-29117-y
  • 语种:English
  • 出版社:Springer Nature
  • 摘要:Reinforcement learning involves decision-making in dynamic and uncertain environments and constitutes a crucial element of artificial intelligence. In our previous work, we experimentally demonstrated that the ultrafast chaotic oscillatory dynamics of lasers can be used to efficiently solve the two-armed bandit problem, which requires decision-making concerning a class of difficult trade-offs called the exploration-exploitation dilemma. However, only two selections were employed in that research; hence, the scalability of the laser-chaos-based reinforcement learning should be clarified. In this study, we demonstrated a scalable, pipelined principle of resolving the multi-armed bandit problem by introducing time-division multiplexing of chaotically oscillated ultrafast time series. The experimental demonstrations in which bandit problems with up to 64 arms were successfully solved are presented where laser chaos time series significantly outperforms quasiperiodic signals, computer-generated pseudorandom numbers, and coloured noise. Detailed analyses are also provided that include performance comparisons among laser chaos signals generated in different physical conditions, which coincide with the diffusivity inherent in the time series. This study paves the way for ultrafast reinforcement learning by taking advantage of the ultrahigh bandwidths of light wave and practical enabling technologies.
国家哲学社会科学文献中心版权所有