Google releases multimodal Gemma 4-12B-it model
Google has announced the release of Gemma 4-12B-it, a new open multimodal model that is part of the broader Gemma 4 family. The model offers significant advancements in natively handling and generating various types of inputs and outputs, facilitating easier integration of advanced AI capabilities by developers.
Google's new 12B open-weights model is a major step forward for local development, combining efficiency with native multimodal support.
- –Gemma 4-12B-it runs efficiently on local hardware with approximately 16GB of VRAM.
- –The encoder-free architecture integrates vision and audio natively into the LLM backbone, improving latency and performance.
- –Released under the Apache 2.0 license, it encourages open ecosystem adoption and integration.
DISCOVERED
1h ago
2026-06-05
PUBLISHED
2h ago
2026-06-05
RELEVANCE
AUTHOR
finguru1980