Compare harnesses not models
|
Harnesses and agentic systems matter as much as AI models, and are key to getting consistent results in software development. We audited Blitzy´s score on SWE-Bench Pro and wrote down our learnings. submitted by /u/quesmahq |
Like
0
Liked
Liked