YOU ARE VIEWING ONE ITEM FROM THE AICRIER FEED

Qwen3.6 heretic variant keeps MTPs

AICrier tracks AI developer news across Product Hunt, GitHub, Hacker News, YouTube, X, arXiv, and more. This page keeps the article you opened front and center while giving you a path into the live feed.

// WHAT AICRIER DOES

7+

TRACKED FEEDS

24/7

SCRAPED FEED

Short summaries, external links, screenshots, relevance scoring, tags, and featured picks for AI builders.

Qwen3.6 heretic variant keeps MTPs
OPEN LINK ↗
// 2h agoMODEL RELEASE

Qwen3.6 heretic variant keeps MTPs

This is a decensored Qwen3.6-35B-A3B derivative that preserves the model's native MTP tensors across multiple release formats. The author ships safetensors, GGUF, NVFP4, and GPTQ-Int4 builds and reports KL divergence of 0.0015 with 10/100 refusals.

// ANALYSIS

This is mostly a distribution and alignment tweak, not a new base model, but it matters for people trying to run Qwen3.6 locally with fewer refusals and full MTP behavior intact.

  • The model card says the original 19 MTP tensors are preserved; the GGUF count looks different because `gate_up_proj` is split in that format.
  • The format spread is the real value here: safetensors for Transformers users, GGUF for local runners, and NVFP4/GPTQ for lower-memory deployments.
  • The reported KL divergence of 0.0015 suggests the ablation stayed close to the base model while changing refusal behavior materially.
  • Because this is an uncensored fork, users should validate quality and safety on their own workloads before treating it as a drop-in replacement.
  • The benchmark framing is useful, but it is still an author-provided eval, so independent verification would matter for serious adoption.
// TAGS
llmopen-weightsmoequantizationevaluationopen-sourceqwen3.6-35b-a3b-uncensored-heretic-native-mtp-preserved

DISCOVERED

2h ago

2026-05-09

PUBLISHED

5h ago

2026-05-09

RELEVANCE

9/ 10

AUTHOR

LLMFan46