What standard RL frameworks do people use these days?

I was aware of TRL from Huggingface but it only supports vLLM as the rollout engine which is giving me problems (older CUDA but newer model).

I came across a few that support sglang – verl, openRLHF, NeMo-Aligner but wanted to see if there are any favorites.

submitted by /u/SnooCapers8442
[link] [comments]

Liked Liked