What standard RL frameworks do people use these days?
I was aware of TRL from Huggingface but it only supports vLLM as the rollout engine which is giving me problems (older CUDA but newer model).
I came across a few that support sglang – verl, openRLHF, NeMo-Aligner but wanted to see if there are any favorites.
submitted by /u/SnooCapers8442
[link] [comments]
Like
0
Liked
Liked