Even the strongest frontier model they used - GPT 5.2 - I would consider barely ...

nozzlegear · 2026-05-24T20:59:09 1779656349

Oldheads remember when GPT 5.2 was at the forefront of agentic programming. December 2025 feels like eons ago, but alack it was an entire half year!

ipaddr · 2026-05-25T02:33:35 1779676415

If I'm not using got 5.5 high reasoning I'm wasting time.

nozzlegear · 2026-05-25T02:59:43 1779677983

Well, maybe so, but how did you feel about 5.2 when it was OpenAI's frontier model? That's what I'm getting at – it was the equivalent of your gpt 5.5 high reasoning just six months ago.

ipaddr · 2026-05-25T16:47:50 1779727670

It was a joke. I think you need to mix up models.

nozzlegear · 2026-05-25T21:50:23 1779745823

Gotcha. Hard to parse tone and intent through text on the internet.

viking123 · 2026-05-25T09:18:38 1779700718

They all feel the same to me now, opus, 5.5, whatever

sigbottle · 2026-05-24T17:04:40 1779642280

Wait isn't gpt 5.2 good? Or is it not thinking / not codex? 5.2 was what sparked the late 2025 openai agentic programming revolution.

mkozlows · 2026-05-25T03:50:49 1779681049

5.2 still had a Codex variant, which this doesn't describe using. It also notably is not using the Codex harness -- it does everything with open source harnesses (which obviously are worse). And while it uses two harnesses with its cheap models, it only uses the worse-performing one of those with GPT 5.2 for cost reasons. (They also don't specify effort/thinking level used for GPT 5.2, but given that it performs worse in their baseline testing than obviously non-SOTA models, I'm guessing it wasn't set to anything high.)