OPEN_SOURCE ↗
REDDIT · REDDIT// 3h agoMODEL RELEASE
Gemma 4 MLX misses thinking mode
Gemma 4’s official release supports configurable thinking modes, but this Reddit thread says the LM Studio MLX build on Apple Silicon is not exposing that reasoning path. The likely culprit is chat-template/back-end wiring, not the base model weights.
// ANALYSIS
This looks more like an integration bug than a model limitation: the model can reason, but the MLX packaging may not be turning that capability on.
- –Google’s Gemma 4 model card says reasoning is built in and thinking is configurable, so the capability exists in the family itself.
- –The LM Studio Gemma 4 MLX template includes `enable_thinking` and `<|think|>` handling, which points to template/config plumbing as the place to check.
- –A matching Hugging Face discussion shows Gemma 4 can lose its thinking channel in certain template paths, so “missing reasoning” can be a rendering/prompting bug rather than a weights issue.
- –LM Studio’s changelog mentions updated Gemma 4 chat-template support and reasoning-related API fields, so upgrading LM Studio and verifying the active template is the first practical fix.
- –For document analysis workflows, preserving the thinking path matters more than raw throughput; speed gains are useful, but not if they disable the model behavior you actually need.
// TAGS
gemma-4mlxlm-studioreasoningllminferenceapple-silicon
DISCOVERED
3h ago
2026-04-28
PUBLISHED
5h ago
2026-04-28
RELEVANCE
9/ 10
AUTHOR
Labtester