A 13.8GB Model Went From 2/10 to 10/10 — and They Never Touched the Model
A 13.8GB local model that fails a SWE-bench task on its own solved the same task 10 times out of 10 once it was wrapped in a state machine…
Like
0
Liked
Liked
A 13.8GB local model that fails a SWE-bench task on its own solved the same task 10 times out of 10 once it was wrapped in a state machine…