OpenClaw misses oMLX prompt cache

// 1h agoINFRASTRUCTURE

OpenClaw misses oMLX prompt cache

OpenClaw users are reporting zero cached tokens against a local oMLX backend even when the same model caches correctly through direct `/v1/chat/completions` calls and Hermes. The likely culprit is OpenClaw’s request shaping for local proxy routes, not oMLX or the Qwen model itself.

// ANALYSIS

This reads less like a model/server bug and more like an agent-runtime mismatch: OpenClaw appears to be changing the prompt or omitting cache-relevant hints in ways Hermes does not.

–OpenClaw docs say local `/v1` backends are treated as proxy-style OpenAI-compatible routes and do not get native OpenAI-only shaping, including prompt-cache hints.
–The user’s config sets `compat.supportsPromptCacheKey: true`, but that only matters if OpenClaw actually forwards the key on the chosen transport path.
–The earlier 2026.2.15 local-cache regression suggests OpenClaw is still sensitive to small prompt-layout changes that can blow prefix caching on local models.
–The fastest debug path is to diff the exact request bodies from Hermes vs OpenClaw, especially system prompt ordering, tool schemas, and any `prompt_cache_key` or Responses-specific fields.

// TAGS

agentinferenceobservabilitydebugginglocal-firstself-hostedopenclawomlx

DISCOVERED

1h ago

2026-05-11

PUBLISHED

2h ago

2026-05-11

RELEVANCE

8/ 10

AUTHOR

juaps

// KEEP READING

More AI developer news from the feed

EXPLORE FULL FEED

NEWS1h ago

Brussee、The Immense Engine始動

Epic GamesやGuerrilla Gamesで経験を積んだオランダ人開発者Arjan Brussee氏が、オランダのスタートアップとともに新しいゲームエンジン「The Immense Engine」を開発していることが明らかになった。特徴は、EUの規制やガイドラインへの準拠を前提にした“ヨーロッパ製”である点と、ClaudeやChatGPTのようなAIエージェントをモジュールとして統合する設計にある。従来の大規模エンジンをAI時代向けに作り替えるのではなく、最初からAIネイティブな基盤として構想しているのが売りで、ゲームだけでなく国防や物流などの3Dワールド用途も視野に入れている。

NEWS2h ago

Claude, Codex rivalry fuels AI coding debate

This Reddit thread treats Claude vs. Codex as a proxy fight over whether Anthropic is really pulling ahead of OpenAI. Revenue and coding reputation are linked, but they measure different things, so this is not evidence that OpenAI is “over.”

UPDATE3h ago

Convex lets apps declare required env vars

This retweeted post points to an upcoming Convex developer-experience feature: letting apps declare the environment variables they require. The change should make configuration expectations more explicit, reduce setup mistakes, and surface missing secrets earlier in the workflow.