How do I improve this (quadruped RL learning)
|
I’m new to RL and new to mujoco, so I have no idea what variables i should tune. Here are the variables ive rewarded/penalized: I’ve rewarded the following:
and I’ve placed penalties on:
submitted by /u/aeauo |
Like
0
Liked
Liked