Local Qwen duo beats vision for web tasks

// 85d agoBENCHMARK RESULT

Local Qwen duo beats vision for web tasks

A Reddit demo shows a local planner-executor setup (Qwen 8B + Qwen 4B) completing browser shopping flows by replanning one action at a time from compact semantic DOM snapshots instead of screenshots. The reported result is a full cart flow on unfamiliar sites with about 15K total tokens, with modal detection/dismissal cited as a major reliability boost.

// ANALYSIS

Stepwise replanning looks like the practical unlock for small local browser agents, because it trades brittle long-horizon guessing for tight state-feedback loops.

–Replanning per DOM snapshot reduces cascading failures when real page state diverges from an initial plan.
–Semantic tables shift the executor into a low-entropy “pick an element ID” task that smaller models can handle.
–Modal/overlay cleanup is doing hidden heavy lifting and should be treated as a core control loop, not a side heuristic.
–The token gap versus vision-heavy flows suggests a clear cost/latency advantage for local-first automation stacks.

// TAGS

predicate-sdk-playgroundqwenagentautomationcomputer-useopen-sourceself-hosted

DISCOVERED

85d ago

2026-03-17

PUBLISHED

85d ago

2026-03-17

RELEVANCE

8/ 10

AUTHOR

Aggressive_Bed7113

// KEEP READING

More AI developer news from the feed

EXPLORE FULL FEED

MODEL21m ago

Anthropic releases public Claude Mythos model

Anthropic has publicly released a modified version of its frontier AI model, Claude Mythos, under the name Claude Fable 5. The new public version incorporates safety guardrails to restrict offensive cyber capabilities while the unrestricted model remains limited to vetted partners.

MODEL25m ago

Anthropic launches Claude Fable 5

Anthropic has launched Claude Fable 5, a new "Mythos-class" model designed for complex agentic workflows, software engineering, and research synthesis. The model is available via the Claude API, subscription plans, and cloud platforms, with safety guardrails that fallback to Claude Opus for risky queries.

UPDATE33m ago

Vercel v0 adds /improve via Claude Fable 5

Vercel has integrated a new /improve command into its generative UI design tool, v0, to let users leverage Anthropic's new Claude Fable 5 reasoning model. The feature allows developers to invoke the model's advanced reasoning capabilities to iterate, polish, and optimize generated UI code.

Local Qwen duo beats vision for web tasks