I just updated my RL notes!

https://github.com/roboticcam/machine-learning-notes

It included both the foundational knowledge such as policy gradient theorem as well as the latest such as GRPO.

submitted by /u/Delicious_Screen_789
[link] [comments]

Liked Liked