A (Long) Peek into Reinforcement Learning

[Updated on 2020-09-03: Updated the algorithm of SARSA and Q-learning so that the difference is more pronounced.

[Updated on 2021-09-19: Thanks to 爱吃猫的鱼, we have this post in Chinese].

Liked Liked