JANG Quantization Beats MLX on MiniMax

// 115d agoBENCHMARK RESULT

JANG Quantization Beats MLX on MiniMax

JANG is a mixed-precision quantization and runtime stack for Apple Silicon that aims to deliver GGUF-like efficiency in MLX without sacrificing Metal speed. The post claims it sharply improves quantized model quality on MiniMax-M2.5 and Qwen3.5 MoE models, especially at 2-bit.

// ANALYSIS

This looks like a real fix for a specific MLX failure mode: not just smaller weights, but better answers from the same local Mac hardware budget. If the numbers hold up outside the author’s harness, JANG could be one of the most useful Apple Silicon inference tools in the local-LLM stack.

–The headline result is stark: JANG_2S scores 74% on MiniMax-M2.5 while MLX 4-bit/3-bit/2-bit cluster around 25%, which is basically random on that test set.
–The repo frames JANG as “the GGUF equivalent for MLX,” but with models staying in GPU memory at full Metal speed, so this is both a format and runtime story.
–The practical upside is biggest on huge local models: the post cites Qwen3.5-122B at 79% with 38 GB versus MLX 2-bit at 56.5% with 36 GB.
–The benchmark story is promising but still self-reported, so third-party replication will matter before anyone treats this as settled evidence.
–Even so, the product fills a clear gap for Mac users who want better coherence than uniform MLX quantization gives them.

// TAGS

jangllminferencebenchmarkopen-sourceedge-ai

DISCOVERED

115d ago

2026-03-18

PUBLISHED

115d ago

2026-03-18

RELEVANCE

8/ 10

AUTHOR

HealthyCommunicat

// KEEP READING

More AI developer news from the feed

EXPLORE FULL FEED

OPEN SOURCE7m ago

ABot-World simulates infinite 720p worlds on single GPU

ABot-World is an open-source, action-conditioned infinite world simulator designed to generate interactive 720p environments at 16 frames per second with low latency on a single desktop GPU. By utilizing an NVIDIA RTX 5090 and requiring just 19GB of GPU memory, this embodied world model offers physical compliance, action controllability, and zero-shot generalization, making real-time, interactive environment simulation accessible on consumer-grade hardware.

RESEARCH7m ago

UCSD researchers successfully demonstrate the first in-vivo teleoperated surgical procedures using general-purpose humanoid robots.

Researchers at the University of California San Diego (UCSD) have achieved a milestone in medical robotics by using Unitree G1 general-purpose humanoid robots (nicknamed "Surgie") to perform laparoscopic gallbladder removals on live animal subjects. The study, published in Nature, evaluated a teleoperated humanoid platform that utilizes standard surgical instruments via custom-made hand adapters. In the trials, the researchers successfully demonstrated both human-robot teams (a humanoid operated by a teleoperator assisting a human surgeon) and robot-robot teams (two humanoids working cooperatively) to complete the surgical tasks. This research indicates that while humanoid platforms are currently slower and less precise than specialized systems like the da Vinci, they offer a far more compact, versatile, and cost-effective alternative that could expand surgical access to remote, rural, or emergency settings.

MODEL7m ago

Reve 2.1 drops native 4K rendering

Reve has released version 2.1 of its creative image generation model, introducing native 4K rendering, object-level editing, and a new "Live Layers" feature. The update enables users to perform localized edits and manage layouts directly, catering to professional design workflows requiring precise control.