ARES: Reinforcement Learning for Code Agents
Hey everyone! My company is releasing ARES (Agentic Research and Evaluation Suite) today: https://github.com/withmartian/ares
We’re hoping ARES can be a new Gym style environment for long horizon coding tasks, with a couple opinionated design decisions:
– async, so it can parallelize easily and to large workloads
– treats LLMRequests as environment observations and LLMResponses as actions, so we can treat the underlying LLM as the policy instead of a full agent orchestrator
– integrates with Harbor (harborframework.com) on the task format, so tons of tasks/coding environments are available
A key motivation for us was that a lot of RL with LLMs today feels like RL kind of by technicality. We believe having a solid Gym style interface (and lots of tasks with it) will let people scale up coding in a similar way as previous successful RL launches!
submitted by /u/theLastNenUser
[link] [comments]