Qwen3.6-35B-A3B guide targets 32GB Macs

// 45d agoTUTORIAL

Qwen3.6-35B-A3B guide targets 32GB Macs

This is a hands-on guide for running Qwen3.6-35B-A3B locally on an M2 MacBook Pro with 32GB RAM using llama.cpp and OpenCode. The post argues that quantization, a 128K context cap, and careful RAM discipline make a surprisingly capable local coding setup practical, if still fragile.

// ANALYSIS

The real story here is not just “local AI on a Mac,” but how much system engineering it takes to make a frontier-ish coding model usable under tight memory pressure.

–The setup leans on a quantized GGUF checkpoint plus `mmproj` support so the model can handle both code and screenshots through llama.cpp.
–Qwen3.6-35B-A3B is positioned as an efficient open-weight MoE model with 35B total and 3B active parameters, so the value prop is density, not brute force.
–OpenCode matters because it makes local models feel like a real coding agent instead of a toy terminal chat.
–The author’s results are nuanced: solid on adapter-style, test-driven work, weaker on geometry-heavy UI debugging and large integration hunts.
–The tuning advice is practical: keep context high enough to avoid collapse, but leave enough headroom for unified memory, browser tabs, and the rest of the machine.

// TAGS

qwen3.6-35b-a3bopencodellama.cppai-codingagentcliself-hostedmultimodal

DISCOVERED

45d ago

2026-04-25

PUBLISHED

45d ago

2026-04-25

RELEVANCE

9/ 10

AUTHOR

boutell

// KEEP READING

More AI developer news from the feed

EXPLORE FULL FEED

MODEL37m ago

Claude Fable 5 safeguards spark controversy

Anthropic's newly launched Claude Fable 5 model has sparked debate due to strict safeguards that restrict cybersecurity tasks and frontier LLM research by silently redirecting users to the older Claude Opus 4.8. While Anthropic gates unrestricted access behind its Project Glasswing tier for cyberdefenders, critics argue these limitations hinder independent research and erode user trust.

BENCHMARK37m ago

OpenCode and Grok top fuzzy find benchmark

A developer conducted an informal benchmark of various AI agents by assigning them the task of performing a fuzzy find operation within their home directory. The test showed that OpenCode's new agent, which utilizes 'fff', performed exceptionally well, as did Grok. Conversely, the developer reported that Claude and Codex struggled with the task, producing strange and unexpected results.

MODEL54m ago

Anthropic launches Claude Fable 5

Anthropic has released Claude Fable 5, a Mythos-class AI model optimized for complex, long-running agentic tasks with a 1 million token context window. The model features strict safety classifiers that route risky queries to Claude Opus 4.8, while its unrestricted counterpart, Claude Mythos 5, remains restricted to vetted partners.