Qwen, Gemma vie for M4 coding

// 45d agoINFRASTRUCTURE

Qwen, Gemma vie for M4 coding

A LocalLLaMA thread asks which local models make sense for heavy coding on a 24GB MacBook M4, with commenters pointing to Qwen and Gemma-class models while warning that cloud models still win for serious coding work. The practical advice centers on smaller quantized models, MLX-aware runtimes, and tools like Ollama or LM Studio.

// ANALYSIS

The useful takeaway is not "run the biggest model you can squeeze in"; it is that 24GB Apple Silicon is a capable local inference box, but still a compromise for coding agents.

–Qwen gets the strongest community nod, with 14B-class models fitting comfortably and 30B-ish quantized models possible but tighter on memory and context
–Gemma is framed as a safer fit on 24GB, while larger Qwen variants may trade too much speed or headroom for quality
–For actual heavy coding, commenters still favor Claude, Codex, or other paid remote models, using local LLMs for drafts, private tasks, or backend automation
–MLX support matters on Mac because memory bandwidth and Apple Silicon-native loaders can make the difference between usable and frustrating

// TAGS

qwengemmallmai-codinginferenceedge-aiopen-weights

DISCOVERED

45d ago

2026-04-21

PUBLISHED

45d ago

2026-04-21

RELEVANCE

6/ 10

AUTHOR

Extra-Perception2408

// KEEP READING

More AI developer news from the feed

EXPLORE FULL FEED

UPDATE4h ago

Hermes Desktop adds Simplified Chinese support

Hermes Desktop, the cross-platform native application for running and managing Nous Research's open-source Hermes AI Agent, has released a complete localization update for Simplified Chinese. The update translates all user interface components including the chat window, sidebar, settings, command center, cron schedules, messages, user profiles, skills, and agents, making the local agent platform more accessible to Chinese-speaking users.

UPDATE4h ago

NVIDIA SkillSpector Secures Claude Code Templates

NVIDIA's open-source security scanner, SkillSpector, has been integrated into the Claude Code Templates repository to scan and protect new AI agent skill additions. SkillSpector detects potential vulnerabilities, prompt injections, and agentic risks by analyzing instruction sets and tool definitions prior to execution, ensuring that third-party contributions do not introduce malicious behaviors or security flaws into development environments.

MODEL6h ago

MAI-Image-2.5 generates its own launch images

Microsoft AI has released MAI-Image-2.5, a high-performance image generation and editing model whose promotional launch images were entirely self-generated by the model itself. Available via Azure AI Foundry and OpenRouter, it targets professional workflows with advanced text rendering, precise image-to-image editing, and improved visual reasoning.