A 13.8GB Model Went From 2/10 to 10/10 — and They Never Touched the Model

A 13.8GB local model that fails a SWE-bench task on its own solved the same task 10 times out of 10 once it was wrapped in a state machine…

Liked Liked