I just updated my RL notes!
https://github.com/roboticcam/machine-learning-notes
It included both the foundational knowledge such as policy gradient theorem as well as the latest such as GRPO.
submitted by /u/Delicious_Screen_789
[link] [comments]
Like
0
Liked
Liked