Google's Gemma 4 12B model faces criticism for slow local performance and poor multimodal generation on Apple Silicon.

// 45d agoBENCHMARK RESULT

Google's Gemma 4 12B model faces criticism for slow local performance and poor multimodal generation on Apple Silicon.

A developer tested Google's Gemma 4 12B model on a MacBook Pro M5 Max with 128GB of unified memory, finding its performance disappointing for a model of its size. Achieving only 44 tokens per second, the developer noted that the model's generation quality was subpar—specifically criticizing a produced lava lamp image—and recommended developers stick with Qwen 3.6 27B or 35B instead.

// ANALYSIS

Google's new encoder-free multimodal architecture may face performance penalties when running locally on consumer hardware.

* A speed of 44 tokens/second on an M5 Max with 128GB unified memory is slow for a 12B parameter model, failing to meet developer expectations for local execution.

* The poor quality of the generated lava lamp suggests that the model's multimodal capabilities are not yet competitive with other options.

* The recommendation of Qwen 3.6 27B or 35B highlights Qwen's strong position in the open-weights space, particularly for local deployment.

// TAGS

[

DISCOVERED

45d ago

2026-06-05

PUBLISHED

45d ago

2026-06-05

RELEVANCE

6/ 10

AUTHOR

bridgemindai

// KEEP READING

More AI developer news from the feed

EXPLORE FULL FEED

VIDEO3m ago

Cache Break Notifier tracks Copilot cache reuse

Cache Break Notifier is a plugin for GitHub Copilot CLI that tracks prompt cache token reuse in real-time to alert developers of costly cache breaks. By monitoring the number of reused tokens on each interaction, the utility helps developers identify and address prompt alterations that cause the model to re-process static context at full price.

MODEL16m ago

Sacks, Ackman raise alarms over China AI

Prominent US tech figures David Sacks and Bill Ackman have publicly expressed concern over China's AI progress following the release of Moonshot AI's Kimi K3 model. The new model reportedly topped the Frontend Code Arena leaderboard, highlighting the rapid advancement of Chinese AI software capabilities amidst a slide in US chipmaker stocks like Nvidia and Micron.

LAUNCH35m ago

Clerion replaces Google Analytics with AI

Clerion is a cookie-free, privacy-first web analytics platform designed to replace Google Analytics, SEO tools, and error monitors. Operating without consent banners, the platform automatically processes traffic patterns to provide clear growth recommendations in plain English while ensuring GDPR compliance by hosting data in the EU.