Qwen launches Qwen3.6-35B-A3B MoE model

// 45d agoOPENSOURCE RELEASE

Qwen launches Qwen3.6-35B-A3B MoE model

Qwen3.6-35B-A3B is presented as an open-source sparse MoE model with 35B total parameters and only 3B active at inference time, aimed at delivering strong efficiency without giving up capability. The launch highlights agentic coding performance, multimodal perception and reasoning, and support for both multimodal thinking and non-thinking modes, with access through Qwen Studio and Hugging Face.

// ANALYSIS

Hot take: if the benchmark claims hold in real workflows, this is the kind of efficiency jump that makes large-model capabilities feel far more deployable.

–The 3B-active/35B-total setup is the main story: it promises much lower serving cost than dense models while keeping a much larger expert pool behind the router.
–The multimodal plus coding angle broadens the appeal beyond pure chat, especially for agentic and developer tooling use cases.
–Apache 2.0 matters as much as the model itself for adoption, since it removes licensing friction for commercial and local deployments.
–The risk is that “on par with 10x larger models” is launch-language; real-world agent reliability and multimodal robustness still need independent validation.

// TAGS

qwenqwen3.6moemultimodalopen-sourceapache-2.0llmcodingagentic

DISCOVERED

45d ago

2026-04-16

PUBLISHED

45d ago

2026-04-16

RELEVANCE

9/ 10

AUTHOR

Infinite-pheonix

// KEEP READING

More AI developer news from the feed

EXPLORE FULL FEED

MODEL49m ago

MiniMax-M3 Hits OpenRouter with 1M Context

The MiniMax-M3 multimodal model has been added to OpenRouter, featuring a 1-million-token context window and highly competitive pricing. At just $0.30 per million input tokens and $1.20 per million output tokens, developers now have cost-effective access to state-of-the-art, long-context multimodal AI capabilities for building advanced applications and agents.

BENCHMARK53m ago

MiniMax M3 Tops GPT 5.5 on SWE-Bench Pro

The recently announced MiniMax M3 model has reportedly beaten GPT 5.5 on the SWE-Bench Pro coding benchmark, scoring 59.0% compared to GPT 5.5's 58.6%. Operating at $0.30 per million input and $1.20 per million output tokens, M3 offers immense price-to-performance potential for highly affordable agentic coding workflows.

MODEL1h ago

MiniMax launches ultra-fast M3 model

MiniMax has announced MiniMax M3, a brand new model architecture featuring a 1-million-token context window, native video input support, and up to 15.6x faster decoding speeds. The model is priced disruptively at $0.30 per million input tokens and $1.20 per million output tokens, positioning it as a highly competitive and efficient multimodal option.