ARES: Reinforcement Learning for Code Agents

Hey everyone! My company is releasing ARES (Agentic Research and Evaluation Suite) today: https://github.com/withmartian/ares

We’re hoping ARES can be a new Gym style environment for long horizon coding tasks, with a couple opinionated design decisions:

– async, so it can parallelize easily and to large workloads

– treats LLMRequests as environment observations and LLMResponses as actions, so we can treat the underlying LLM as the policy instead of a full agent orchestrator

– integrates with Harbor (harborframework.com) on the task format, so tons of tasks/coding environments are available

A key motivation for us was that a lot of RL with LLMs today feels like RL kind of by technicality. We believe having a solid Gym style interface (and lots of tasks with it) will let people scale up coding in a similar way as previous successful RL launches!

submitted by /u/theLastNenUser
[link] [comments]

Liked Liked