Qwen3.6 MLX port trims refusals on Macs

// 23h agoMODEL RELEASE

Qwen3.6 MLX port trims refusals on Macs

This is an MLX release of an abliterated Qwen3.6-35B-A3B variant, built from a Heretic source checkpoint and quantized to 4-bit for local Apple Silicon deployment. The model card says it keeps the base model’s reasoning and instruction-following profile while removing refusal behavior at the weight level, and it was validated with short chat, reasoning, and code smoke tests.

// ANALYSIS

This is less about benchmark theatrics and more about a practical local-chat stack for Apple Silicon users who want a large MoE model that runs fast on Macs. The MLX packaging is the main value, the abliterated positioning matters if you want fewer refusals, the 4/6-bit layer-aware quantization suggests more care than a flat 4-bit pass, and the validation is light enough that the "best chatbot" claim should still be treated as anecdotal.

// TAGS

qwenmlxapple-siliconlocal-firstquantizationuncensoredmoe

DISCOVERED

23h ago

2026-05-08

PUBLISHED

1d ago

2026-05-08

RELEVANCE

8/ 10

AUTHOR

eclipsegum

// KEEP READING

More AI developer news from the feed

EXPLORE FULL FEED

UPDATE1h ago

OpenCode adds built-in which-key plugin

The upcoming OpenCode release adds a built-in which-key plugin that shows the currently active keybindings at any time, making the terminal UI easier to discover and use. The post is a repost of a short teaser, but the core signal is clear: OpenCode is continuing to polish its TUI ergonomics for power users who rely on keyboard-driven workflows.

NEWS2h ago

Anthropic’s SpaceX deal lifts Claude limits

Theo’s video covers Anthropic’s May 6, 2026 announcement of a compute partnership with SpaceX. The deal expands Claude capacity and raises Claude Code and Claude Opus limits.

BENCHMARK2h ago

ClickUp agents top ChatGPT, Claude evaluations

ClickUp’s benchmark report says its Certified Agents scored 96/100 and outperformed ChatGPT with connectors, Copilot, Notion agents, and Monday agents on execution-ready project planning. The claim is really about workflow orchestration and context inside the work system, not raw model intelligence.