Qwen3.6 abliterated variant lands on HF

// 90d agoMODEL RELEASE

Qwen3.6 abliterated variant lands on HF

Wangzhang published an abliterated Qwen3.6-35B-A3B checkpoint on Hugging Face, tuned around MoE-specific refusal behavior rather than dense-model attention paths. The repo claims 7/100 refusals under a stricter LLM-judge eval, with low KL drift from the base model.

// ANALYSIS

This is a useful reminder that “uncensoring” MoE models is not just a bigger LoRA problem; refusal behavior can live in expert routing and expert-specific projections.

–The method targets O-proj, MLP down-proj, and expert slices while explicitly disabling Q/K/V, which matches the claim that the safety signal is routed through MoE experts.
–The 7/100 refusal result is more credible than many flashy abliterated model cards because it uses longer generations and a judge model instead of simple keyword checks.
–Router biasing toward selected “safety experts” is a strong intervention, but it also raises the risk of brittle behavior outside the exact eval set.
–Low KL divergence suggests the base model’s general behavior is being preserved reasonably well, which matters more than raw uncensoring if the model still needs to be usable.
–This sits in the same broader trend as other Qwen abliterations: community demand is clearly for local, less-filtered variants, especially on MoE checkpoints where the intervention surface is different.

// TAGS

llmopen-sourceqwen3.6-35b-a3b-abliterated

DISCOVERED

90d ago

2026-04-17

PUBLISHED

90d ago

2026-04-17

RELEVANCE

9/ 10

AUTHOR

Free_Change5638

// KEEP READING

More AI developer news from the feed

EXPLORE FULL FEED

NEWS56m ago

Claude safety filters block retro emulator developer

Independent developer Pieter Levels reported that his workflow for reverse engineering vintage Windows applications to make them compatible with his web emulator (pieter.com) is being blocked by Anthropic's AI models. After his queries were flagged by Claude Fable 5's cybersecurity safeguards, he attempted to fall back to Claude Opus 4.8, only to find that its strict safety and refusal measures blocked his requests as well.

MODEL1h ago

Moonshot Prepares 3-Trillion Parameter Kimi K3

According to a report by the Financial Times, Chinese AI startup Moonshot is poised to release its new Kimi K3 model as early as tonight. The model is rumored to possess between 2 and 3 trillion total parameters, which would make it the largest AI model released in China so far.

UPDATE2h ago

Lightpanda agent REPL renders styled terminal markdown

Lightpanda has introduced a markdown-to-ANSI terminal renderer for its interactive agent REPL, styling headings, lists, inline formatting, and OSC 8 clickable links. The rendering is gated exclusively to interactive TTY sessions to avoid breaking machine-readable piped workflows.