OpenCode agent temps bypass llama.cpp defaults

// 90d agoTUTORIAL

OpenCode agent temps bypass llama.cpp defaults

OpenCode does support per-agent temperature settings, but its docs and the Reddit answer both point to a layering issue: the agent config sets request-level sampling, while llama.cpp still shows its own server-side defaults unless those are set separately. If you’re only looking at llama-server verbose startup output, that can look like OpenCode “didn’t work” even when the client is sending overrides.

// ANALYSIS

The likely bug here is config precedence, not broken sampling. OpenCode can specify `temperature` per agent, but llama.cpp also has its own defaults, so the server log alone is not proof that the agent setting was ignored.

–OpenCode’s docs explicitly support agent-scoped `temperature`, and say omitted values fall back to model-specific defaults.
–llama.cpp’s server has its own baseline sampler defaults, so startup/verbose output can be misleading if you’re expecting it to mirror per-request values.
–The practical fix is to set a sensible default in llama.cpp and then tune agent/subagent temperatures in OpenCode.
–For local models, the model card’s recommended sampler settings still matter more than generic “best” temperatures.
–This is a classic client-vs-server config mismatch: if you want to verify behavior, inspect the actual request payload, not just the server’s default log line.

// TAGS

opencodellama.cppagentclillmtemperatureself-hosted

DISCOVERED

90d ago

2026-04-17

PUBLISHED

90d ago

2026-04-17

RELEVANCE

7/ 10

AUTHOR

Careless-Marzipan-65

// KEEP READING

More AI developer news from the feed

EXPLORE FULL FEED

LAUNCH16m ago

PALO-AI launches agentic governance architecture

Fabrizio Degni has announced the developer preview of PALO-AI, a reference architecture that uses governance contracts to manage and audit the delegated authority of autonomous agents and collaborative teams. The preview includes sample JSON contracts, Rego policies, Model Context Protocol (MCP) tool definitions, and integration examples for n8n and Dify.

TUTORIAL45m ago

Microsoft "ML for Beginners" adds 50+ translations

Microsoft's popular 12-week open-source machine learning curriculum, ML for Beginners, has been updated to offer automated, always up-to-date translations into more than 50 languages, including Arabic, Hindi, and Swahili. This update aims to lower barriers to entry for aspiring machine learning practitioners globally by making the educational content accessible in their native languages.

LAUNCH1h ago

Fly.io launches Sprites, providing stateful and hardware-isolated Linux sandbox environments with fast copy-on-write checkpoint and restore capabilities.

Fly.io has introduced Sprites, which are stateful sandbox environments running in hardware-isolated AWS Firecracker microVMs designed for executing arbitrary, untrusted code or AI agents. Unlike traditional ephemeral serverless functions, Sprites retain their disk state between runs, utilizing a fast NVMe filesystem that continuously syncs to durable external storage. The platform features an ultra-fast copy-on-write checkpoint and restore system taking about 300ms, granular network egress policies using simple domain-level allowlists, and custom port forwarding for public or private service access. Sprites scale to zero and burst dynamically, meaning developers only pay for actual CPU, memory, and written storage usage.

OpenCode agent temps bypass llama.cpp defaults