Qwen3.6 heretic v2 keeps MTP, slashes refusals

REDDIT · REDDIT// 2h agoMODEL RELEASE

Qwen3.6 heretic v2 keeps MTP, slashes refusals

LLMFan46 released a Qwen3.6-27B uncensored fork that preserves all 15 native MTP heads while shipping safetensors, GGUF, and NVFP4 builds. The repo claims 0.0021 KL divergence and 6/100 refusals, making this a local-serving variant aimed at keeping Qwen3.6’s speed and flexibility intact.

// ANALYSIS

The interesting part here is not the “uncensored” label, it’s that the fork tries to keep speculative decoding and quantization-friendly deployment at the same time. That is the difference between a novelty model and something people will actually run.

–The release says all 15 MTPs are retained, which matters for throughput on local inference stacks that can exploit speculative decoding
–Multiple packaging formats widen the audience: safetensors for standard serving, GGUF for llama.cpp-style local use, and NVFP4 for smaller-footprint GPU deployment
–The reported 0.0021 KL divergence and 6/100 refusal rate are attractive, but they are self-reported model-card metrics, not independent evals
–This is a derivative of Qwen3.6-27B, so the underlying appeal is still Qwen’s strong multimodal and agentic-coding base rather than a brand-new model family
–For AI devs, the practical question is whether preserved MTP outweighs the usual quality hit from aggressive post-processing; that is the real tradeoff here

// TAGS

llmopen-weightsquantizationmultimodalinferenceqwen3.6-27b-uncensored-heretic-v2-native-mtp-preserved

DISCOVERED

2h ago

2026-05-07

PUBLISHED

3h ago

2026-05-07

RELEVANCE

9/ 10

AUTHOR

LLMFan46

// KEEP READING

More AI developer news from the feed

EXPLORE FULL FEED

INFRA34m ago

GitHub Copilot recovers from PR incident

GitHub's status page says the incident affecting Copilot code review and cloud agents for pull request comments has been mitigated, and the team is monitoring for full recovery. The latest update, posted May 7, 2026 at 06:14 UTC, indicates the service is starting to work again after earlier degradation.

OPEN SOURCE47m ago

Anthropic ships Claude finance agent repo

Anthropic’s May 5 release puts ten ready-to-run financial-services agent templates into a GitHub marketplace repo, with matching Claude Cowork plugins and Claude Managed Agents cookbooks. The bundle targets pitchbooks, KYC, month-end close, reconciliations, and other audit-heavy finance workflows.

LAUNCH1h ago

xAI launches Connectors on Grok Web

On May 6, 2026, xAI launched Connectors on Grok Web, adding native OAuth integrations that let Grok work across everyday tools instead of just chatting about them. The initial rollout includes built-in support for Gmail and Google Calendar, Google Drive, OneDrive, Outlook Mail and Calendar, and SharePoint, plus a broader connector catalog with services like GitHub, Notion, Slack, and Linear.