Can’t train a pixel-based SAC for Walker2D environment

Hi, everyone.

Now I decided to try a new challenge: pixel-based SAC model for Walker2d environment. My problem is that even after a lot of training, it inmediatly falls. I have tried using optuna for hyperparameter search, but got nothing out of it.

I am using stable-baselines 3 library to train it. I tried training with the by-default reward and with custom reward, but it turned out almost the same outcome: no walking at all. I do not know what else to do.

If anyone had any suggestions/tips, it would be much appreciated!

submitted by /u/skroll18
[link] [comments]

Liked Liked