Qwen3.6 mini GGUFs trim MTP grafting

// 45d agoOPENSOURCE RELEASE

Qwen3.6 mini GGUFs trim MTP grafting

This project strips Qwen3.6 GGUFs down to just the MTP tensors needed by buzz’s grafting script. The result is two tiny donor files, about 900MB for the 35A3B variant and 451MB for the 27B variant, instead of full 38GB and 29GB downloads.

// ANALYSIS

Useful niche plumbing: it does not make inference easier by itself, but it removes the most annoying part of the MTP grafting workflow for people already managing local model libraries.

–The payoff is bandwidth and setup time, not new capability; these are compatibility shims for an existing conversion script.
–The author claims SHA256 parity against outputs made from the full models, which is the right validation for this kind of utility.
–Scope is narrow: only the two tested Qwen3.6 variants are covered, and the post itself warns that other model variants may fail.
–The artifact depends on an unstable MTP ecosystem, so treat it as a convenience layer rather than something to archive as canonical.

// TAGS

llmopen-weightsquantizationinferenceopen-sourceqwen3.6-mtp-tensors-only

DISCOVERED

45d ago

2026-05-08

PUBLISHED

45d ago

2026-05-07

RELEVANCE

7/ 10

AUTHOR

AzerbaijanNyan

// KEEP READING

More AI developer news from the feed

EXPLORE FULL FEED

OPEN SOURCE18m ago

Subgen automates subtitles for Plex, Emby

Subgen is a self-hosted subtitle generator for Plex, Emby, and Jellyfin that automates transcription using Whisper AI. The tool handles webhook triggers, Bazarr integration, and tricky edge cases like audio stream offsets and AMD ROCm GPU acceleration.

OPEN SOURCE1h ago

agentcn brings a shadcn-style component registry to AI agent development, allowing developers to add customizable agent recipes directly into their codebases.

agentcn is an open-source command-line tool that lets developers add complete, fully customizable AI agent recipes directly into their projects. Mirroring the copy-paste philosophy of shadcn/ui, the tool serves as an alternative to monolithic, black-box agent frameworks. Developers can use a single command to drop ownable, production-ready agent recipes—including instructions, tools, skills, and workflows—directly into their local codebase for complete control and customization.

UPDATE1h ago

Developer reports successful scaling of LazyCodex with 400 subagents in the Codex desktop app while requesting a creator account unban.

A user shared an update on LazyCodex, an agent harness for OpenAI's Codex desktop app, reporting that it is performing well while running up to 400 subagents despite getting stuck once. The post also appeals to unban developer Yeongyu Kim (@q_yeon_gyu_kim) on X/Twitter, highlighting screenshots of the tool's execution.