LM Studio announces support for Google's newly released Gemma 4 12B encoder-free multimodal model

// 45d agoMODEL RELEASE

LM Studio announces support for Google's newly released Gemma 4 12B encoder-free multimodal model

LM Studio has announced immediate local support for Google's newly launched Gemma 4 12B model. Released by Google DeepMind on June 3, 2026, Gemma 4 12B is a unified, encoder-free multimodal model designed to run efficiently on consumer-grade hardware with at least 16GB of RAM or VRAM. By projecting visual and audio inputs directly into the LLM backbone rather than using separate encoders, the model dramatically reduces latency. LM Studio users can now download, run, and chat with Gemma 4 12B locally on Mac, Windows, and Linux via GGUF or MLX formats.

// ANALYSIS

Local multimodal AI is transitioning from a niche developer experiment to a mainstream desktop capability. Google's encoder-free architecture in Gemma 4 12B significantly reduces the resource overhead of vision and audio processing, making LM Studio the perfect consumer gateway for on-device agentic workflows without cloud dependencies.

* Encoder-free efficiency: Eliminating separate vision/audio projection layers reduces memory footprints and drastically lowers multimodal latency for local hardware.

* Democratizing agentic AI: The 12B parameter size fits comfortably within consumer-grade 16GB systems, bringing near-frontier intelligence directly to edge machines.

* Ecosystem speed: LM Studio’s rapid same-day support demonstrates the high agility of local inference communities compared to traditional enterprise release cycles.

// TAGS

lm-studiogemma-4gemma-4-12blocal-aimultimodalproduct-updategoogle-deepmind

DISCOVERED

45d ago

2026-06-04

PUBLISHED

45d ago

2026-06-04

RELEVANCE

8/ 10

AUTHOR

lmstudio

// KEEP READING

More AI developer news from the feed

EXPLORE FULL FEED

MODEL39m ago

Alibaba drops 2.4-trillion parameter Qwen3.8 MoE

Alibaba Cloud has unveiled Qwen3.8-Max-Preview, a 2.4-trillion-parameter Mixture-of-Experts (MoE) multimodal model available via its Token Plan and Qoder. The proprietary preview targets enterprise developers with significant upgrades in coding and analysis, with plans for a future open-source release.

OPEN SOURCE2h ago

Jellium Desktop launches as independent Jellyfin client

Jellium Desktop is an unofficial, Rust-based desktop client for Jellyfin that continues the development of the former official client under independent stewardship. The app integrates CEF and mpv to deliver a native, high-performance playback experience.

UPDATE3h ago

Think Agents plans ThinkOS beta next month

Think Agents has announced that the public beta of ThinkOS is on track to launch next month. The platform is a model-agnostic, private-data, and locally-hosted AI agent operating system designed for users to coordinate autonomous agents while ensuring complete data ownership.