Qwen3.5-35B-A3B powers flawless 27-step local agent chain

// 110d agoBENCHMARK RESULT

Qwen3.5-35B-A3B powers flawless 27-step local agent chain

A Reddit user says Qwen3.5-35B-A3B completed a 27-call local video workflow end to end, from Whisper transcription to subtitle burning, without a single error or manual intervention. The whole run stayed on a Lenovo P53 with llama.cpp and whisper.cpp, no cloud APIs, making it a strong real-world demo for a sparse MoE model on mid-range hardware.

// ANALYSIS

MoE is starting to look like a real advantage, not just an architecture footnote. The interesting part here is less that Qwen answered well and more that it held state across a long, messy tool chain and finished the job locally.

–27 sequential tool calls with verification is a better agent test than a single prompt-response benchmark.
–The official model card says 35B total parameters and 3B activated, which is exactly the kind of sparsity that makes local deployment plausible.
–Fully local execution with llama.cpp and whisper.cpp removes cloud latency, cost, and privacy friction.
–Video-to-subtitles is a good stress test because it mixes planning, file I/O, transcription, and post-processing.
–Ten minutes end to end is slow, but if it stays reliable, that's a tradeoff many local workflows will happily take.

// TAGS

qwen3.5-35b-a3bllmagentself-hostedopen-weightsautomationinferencebenchmark

DISCOVERED

110d ago

2026-03-25

PUBLISHED

110d ago

2026-03-25

RELEVANCE

9/ 10

AUTHOR

cride20

// KEEP READING

More AI developer news from the feed

EXPLORE FULL FEED

OPEN SOURCE41m ago

scroll-world launches scroll-driven 3D flight skill

scroll-world is an open-source, framework-agnostic agent skill that leverages Higgsfield to generate immersive, scroll-driven 3D camera flights through diorama scenes for landing pages. By rendering seamless connection clips between neighboring frames, it allows developers to build interactive 3D narrative websites navigated simply by scrolling, without requiring heavy game engines.

MODEL1h ago

OpenAI GPT-5.6 hits Amazon Bedrock

OpenAI's GPT-5.6 model family—including Sol, Terra, and Luna—is now generally available on Amazon Bedrock. Running on Bedrock's next-generation inference engine, the models support prompt caching with a 90% discount and match OpenAI's first-party pricing.

UPDATE2h ago

OpenRouter splits rankings by model weight

OpenRouter has updated its rankings platform by introducing separate leaderboards for open-weight and closed-weight models. This allows developers to track and compare usage statistics of proprietary, API-exclusive models against downloadable open-weight models.