Ask HN: What LLM models are you using and why?

freakynit · 2026-05-17T17:06:00 1779037560

1. gpt-5.5-medium for most demanding coding tasks.

2. gpt-5.3-codex-medium for genrally most of the other coding tasks.

3. deepseek-v4-flash for heavy agentic research/loops (non-coding related).

4. mimo-v2.5-pro for crunching/summarizing large texts.

5. gemini-3.1-flash-lite for image understanding.

6. opus-4.7 very occasionally when gpt-5.5 fails, or vice-versa, and sonnet-4.6 when codex-5.3 fails.

7. deepseek-v4-pro when I need to do a long agentic session, and want higher quality, for cheap (non-coding).

8. perplexity/pplx-embed-v1-0.6b for embeddings, via openrouter.

kifler · 2026-05-20T15:16:26 1779290186

Just curious what constitutes a 'demanding' coding task for you.

zambelli · 2026-05-17T18:40:00 1779043200

I use Opus 4.7 for personal stuff (basically for everything), but have been considering gpt-5.5 given all I hear about it.

At work I use 4.6 because we don't have 4.7 yet...zzz...

I also do a LOT of personal/portfolio work with self-hosted models.

Ministral-3-14B-Reasoning for validating concepts, MVPs, etc and some prod systems (punches above its weight class). Qwen3.6-35B-A3B for self-hosted coding (custom harness). GPT-OSS-120B for self-hosted coding or more reasoning-intensive agentic flows. Qwen3.5-122B-A10B currently in evals for agentic coding.

dgunay · 2026-05-18T17:39:43 1779125983

For straightforward coding tasks I use gpt-5.3-codex on high or xhigh. Sometimes I try 5.5 but overall 5.3-codex is more than capable enough for most of my needs and quite a bit cheaper.

For more interactive/discussion/planning or orchestration stuff, I find myself going back and forth between Opus 4.7 and GPT 5.5. Still not sure which one I prefer.

cfunderburg · 2026-05-20T08:59:38 1779267578

I only use Anthropic models. Haven't touched GPT for a long time after I found myself swearing at them.

Opus 4.7, or 4.6 where it's still available at work: For spec'ing up projects or changes. The 15x multiplyer on Copilot means I rarely do this.

Sonnet 4.6 everywhere else. It rarely fails me.

david_d8912 · 2026-05-17T05:17:42 1778995062

GPT-5.5 + Opus-4.7 here. Codex for pure coding task with clear goal, claude code for the rest. Also combined with opencode to experiment new models.

fyi: I didn't have much lock on Deepseek v4 pro, with opencode + openrouter it's incredibly slow. How did op did it?

yossuf2000 · 2026-05-17T13:44:10 1779025450

GPT 5.5 main opus 4.7 frontend and when i need something different kimi 2.6 and GLM 5.1 when i don't have to pay on the task (using the opencode go subscription)

dennisjoseph · 2026-05-19T01:22:50 1779153770

Claude Sonnet for daily tasks, GPT 5.5 for reviewing Sonnet’s work, and Qwen for very specific tasks

late_night_fix · 2026-05-17T10:40:31 1779014431

GPT-5.5 daily.Opus for hard stuff.Deep seek for long context+ cheap iteration.Everthing else is routing and tool now.

teppeik · 2026-05-17T20:20:08 1779049208

By default, I use Sonnet 4.6, and if Sonnet 4.6 fails, I use Opus 4.7.

VishnuTech · 2026-05-17T11:13:07 1779016387

GPT-5.5 for daily ideas and brainstorming. It has become my daily go to.

dnnddidiej · 2026-05-17T13:17:29 1779023849

Opus 4.6. Does the job. Not much of an experimenter.

farwaabbas · 2026-05-17T07:24:39 1779002679

for idea using gpt3.5,claude for coding and also impressed by deepseek it large context window is really useful for long projects.

enceladus06 · 2026-05-17T17:05:19 1779037519

Opus 4.7 in Vscode via Claude Code.

cyanydeez · 2026-05-18T00:57:55 1779065875

qwencodernext. ask me about what it does and doesnt do.

s3lcx · 2026-05-20T20:25:10 1779308710

opus 4.7