Gemma 4 MoE hits 12 t/s on Lunar Lake

// 46d agoBENCHMARK RESULT

Gemma 4 MoE hits 12 t/s on Lunar Lake

Developers are successfully running Google's new Gemma 4 26B MoE models on Intel Lunar Lake integrated graphics via Vulkan. The hardware's on-package memory architecture delivers highly usable inference speeds without requiring a discrete GPU.

// ANALYSIS

Intel's Lunar Lake architecture is quietly becoming a powerhouse for local LLM inference, proving that high memory bandwidth can offset the lack of a discrete GPU.

–The Gemma 4 26B Mixture of Experts (MoE) model hits a hardware sweet spot by only activating ~4B parameters per token
–Lunar Lake's 32GB of on-package LPDDR5X memory eliminates the traditional CPU-to-GPU bus latency, providing the crucial bandwidth needed for large models
–While native OpenVINO optimization currently struggles with 20B+ models on the NPU, community-compiled Vulkan bridges are effectively leveraging the Xe2 iGPU
–Achieving 7-12 tokens per second for a 26B model on a thin-and-light laptop significantly lowers the hardware barrier for local AI development

// TAGS

gemma-4llminferenceedge-aiopen-weights

DISCOVERED

46d ago

2026-04-12

PUBLISHED

46d ago

2026-04-12

RELEVANCE

8/ 10

AUTHOR

No-Key8555

// KEEP READING

More AI developer news from the feed

EXPLORE FULL FEED

NEWS1h ago

Dev lets Claude trade BTC overnight, nets $95 profit

A developer gave Claude a $20 budget to autonomously script and execute Bitcoin trades overnight, waking up to a functional trading bot and a $95 profit across five trades.

OPEN SOURCE2h ago

Plannotator 0.19.24 adds Amp support and configurable storage

Plannotator 0.19.24 is a substantial release that expands the tool beyond Claude Code with native Amp support, adds a `PLANNOTATOR_DATA_DIR` override so users can move the default `~/.plannotator` data directory, introduces Auto Mode in the permission selector for newer Claude Code versions, and fixes a Pi approval crash after plan acceptance. The update folds multiple stacked PRs into one release and pushes the project further toward a multi-agent review layer rather than a single-agent hook utility.

NEWS2h ago

Aaronson says AI turns mathematicians into curators

Scott Aaronson says recent AI results in mathematics, including a GPT-5.5 Pro solution to Erdős’s Unit Distance Problem, suggest humans may increasingly focus on choosing questions and interpreting model outputs. He extends the argument to AI-written fiction and the Vatican’s AI encyclical as signs of a broader cultural shift.