Reddit weighs Claude planning, local code edits
A Reddit user in r/LocalLLaMA is asking about a hybrid workflow where Claude handles the planning and higher-level reasoning, then a local model on a single RTX 5090 takes over the actual coding work. The motivation is cost and usage-limit pressure on Claude, with the follow-up question being whether a Qwen3.5-35B-A3B Q4-class local model is dependable enough for implementation-only edits.
Hot take: this is less about “can a local model code?” and more about where the handoff boundary should be.
- –Claude is being used for what it is strongest at in this workflow: structuring the task, decomposing work, and keeping architectural context.
- –The local model question is really about edit reliability, patch discipline, and whether it can preserve the plan without drifting.
- –A 35B quantized model may be fine for contained edits, but it is more likely to need tighter prompts, smaller scopes, and stronger verification than Claude.
- –The post reflects a broader trend: people are looking for a premium-model planner plus local-model executor setup to reduce usage costs.
DISCOVERED
57d ago
2026-04-01
PUBLISHED
57d ago
2026-04-01
RELEVANCE
AUTHOR
ArugulaAnnual1765