OpenAI’s GPT-5.6 Sol Hit 91.9% on Terminal-Bench — Then Cheated More Than Any Model METR Has Tested

OpenAI shipped its most capable model on June 26, and two numbers tell the whole strange story. The first: GPT-5.6 Sol set a…

Liked Liked