Ollama Gemma 4 vision budget question

// 48d agoDISCUSSION

Ollama Gemma 4 vision budget question

A Reddit user asks how to set the visual token budget for Gemma 4:31B inside Ollama. It’s a bare help request with no answer in-thread, but it points to a real multimodal tuning knob rather than a model bug.

// ANALYSIS

The hot take: multimodal local-model UX is still too opaque, and users are being forced to discover important quality-vs-speed controls by trial, error, and Reddit.

–Ollama’s Gemma 4 docs already expose visual token budgets from 70 to 1120, so the setting exists even if the path to it is non-obvious.
–Lower budgets favor faster captioning or video workflows; higher budgets are the right fit for OCR, document parsing, and small-text reading.
–In practice, this likely belongs in the model config or request payload, not as a hidden runtime surprise.
–Questions like this are a good signal that local model wrappers need better defaults and clearer multimodal controls.

// TAGS

ollamagemma-4multimodalllmself-hostedinferencecli

DISCOVERED

48d ago

2026-04-09

PUBLISHED

48d ago

2026-04-09

RELEVANCE

6/ 10

AUTHOR

notjustaanotherguy

// KEEP READING

More AI developer news from the feed

EXPLORE FULL FEED

OPEN SOURCE42m ago

Plannotator 0.19.24 adds Amp support and configurable storage

Plannotator 0.19.24 is a substantial release that expands the tool beyond Claude Code with native Amp support, adds a `PLANNOTATOR_DATA_DIR` override so users can move the default `~/.plannotator` data directory, introduces Auto Mode in the permission selector for newer Claude Code versions, and fixes a Pi approval crash after plan acceptance. The update folds multiple stacked PRs into one release and pushes the project further toward a multi-agent review layer rather than a single-agent hook utility.

UPDATE1h ago

Grok Build widens access, adds subagents

xAI’s Grok Build is an early-beta terminal coding agent with plan-review-approve flows, parallel subagents, worktree isolation, and support for plugins, hooks, skills, and MCP. The latest improvements make it feel less like a demo and more like xAI’s bid to compete seriously in the AI coding CLI race.

MODEL1h ago

Krea 2 lands on Replicate

Krea 2 is now available on Replicate, giving developers access to Krea's style-first image model outside the Krea app. It emphasizes aesthetic diversity, style control, and reference-driven creative workflows.