Korrel: turn one agent eval into a verifiers or OpenEnv RL environment, with a fidelity proof against tau2-bench

submitted by /u/IssaLikesCheese
[link] [comments]

Liked Liked