How to save the policy with best performance during training with CleanRL ?

digitado ⋅ 26 de February de 2026

Hi guys, I’m new to the libary CleanRL. I have run some training scripts by using the `uv run python cleanrl/….py` command. I’m not sure if this can save the best policy (e.g. the policy returns best episode rewards) during training. I just went through the documentation of CleanRL and found no information about this. Do you know how can I save the best policy during training and load it after training ?

submitted by /u/ZitaLovesCats
[link] [comments]

Like 0

Liked Liked