サプトロ イスティアサ
Stanford - XCS234: Reinforcement Learning
Foundations of RLDynamic programming, Monte Carlo methods & temporal difference learning, RL with value function approximation, Policy gradient methodsBatch/offline reinforcement learningMonte Carlo tree search, Open challenges and hot topics in RL