Gemma 4 MLX misses thinking mode

// 94d agoMODEL RELEASE

Gemma 4 MLX misses thinking mode

Gemma 4’s official release supports configurable thinking modes, but this Reddit thread says the LM Studio MLX build on Apple Silicon is not exposing that reasoning path. The likely culprit is chat-template/back-end wiring, not the base model weights.

// ANALYSIS

This looks more like an integration bug than a model limitation: the model can reason, but the MLX packaging may not be turning that capability on.

–Google’s Gemma 4 model card says reasoning is built in and thinking is configurable, so the capability exists in the family itself.
–The LM Studio Gemma 4 MLX template includes `enable_thinking` and `<|think|>` handling, which points to template/config plumbing as the place to check.
–A matching Hugging Face discussion shows Gemma 4 can lose its thinking channel in certain template paths, so “missing reasoning” can be a rendering/prompting bug rather than a weights issue.
–LM Studio’s changelog mentions updated Gemma 4 chat-template support and reasoning-related API fields, so upgrading LM Studio and verifying the active template is the first practical fix.
–For document analysis workflows, preserving the thinking path matters more than raw throughput; speed gains are useful, but not if they disable the model behavior you actually need.

// TAGS

gemma-4mlxlm-studioreasoningllminferenceapple-silicon

DISCOVERED

94d ago

2026-04-28

PUBLISHED

94d ago

2026-04-28

RELEVANCE

9/ 10

AUTHOR

Labtester

// KEEP READING

More AI developer news from the feed

EXPLORE FULL FEED

RESEARCH1m ago

MANTA enables dynamic topology adaptation for multi-agent systems

MANTA (Multi-Agent Network Topology Adaptation) is a research framework that allows multi-agent LLM systems to dynamically reconfigure their communication topologies at inference time. By combining trace auditing with verbal playbooks during execution, it enables agent teams to optimize collaboration efficiency and achieve superior results on complex benchmarks such as PlanCraft.

OPEN SOURCE2h ago

OpenWorker launches open-source autonomous desktop agent

OpenWorker is an open-source, local-first autonomous desktop co-worker that operates across local documents, terminal commands, and over 25 third-party integrations. Built to execute end-to-end workflows such as file generation and application updates, OpenWorker supports scheduled recurring background jobs while enforcing explicit human approval for high-consequence actions.

POLICY2h ago

White House formalizes frontier AI evaluation framework

Following closed-door briefings with top AI executives including Sam Altman, the US White House met its August 1st deadline to formalize a pre-release evaluation framework for frontier AI models. The framework introduces new federal pacing guidelines that will shape how developers build, evaluate, and deploy next-generation AI systems.