FlashSAC: Fast and Stable Off-Policy Reinforcement Learning for High-Dimensional Robot Control
https://reddit.com/link/1sep2lt/video/tmacpy2vzptg1/player
We scaled off-policy RL for sim-to-real. FlashSAC is the fastest and most performant RL algorithm across IsaacLab, MuJoCo Playground, Genesis, DeepMind Control Suite, and more, all with a single set of hyperparameters.
If you’re still using PPO, give FlashSAC a try.
submitted by /u/joonleesky
[link] [comments]
Like
0
Liked
Liked