Ryzen AI Max+ 395 hits long-context wall

// 65d agoINFRASTRUCTURE

Ryzen AI Max+ 395 hits long-context wall

A Bosgame M5 128GB user on AMD's Strix Halo platform says Claude Code-style document agents feel faster on Vulkan than ROCm, even though ROCm should win prompt processing on paper. The real pain point is long-context work, where performance drops hard once prompts push past roughly 50K tokens.

// ANALYSIS

Strix Halo is powerful enough for local agents, but this thread shows the real bottleneck is backend behavior under long context, not model size. AMD's own docs now position Ryzen AI Max+ 395 for MCP-heavy workflows, yet the software stack still needs tuning before it feels effortless.

–AMD's ROCm docs say the supported llama.cpp fork differs from upstream ggml-org builds, so Docker image choice can change behavior materially.
–Official Strix Halo guidance frames memory as GPUVM/GTT-mapped system RAM, making UMA and KV-cache placement a first-order performance knob.
–Community reports on Strix Halo suggest ROCm can lead prompt-processing tests, while Vulkan may feel smoother once generation and very long contexts are included.
–For document-centric agents, batch ingestion, reuse KV cache, and benchmark at real context sizes rather than small prompt benchmarks.

// TAGS

amd-ryzen-ai-max-plus-395llmagentinferenceself-hostedmcpgpu

DISCOVERED

65d ago

2026-03-24

PUBLISHED

65d ago

2026-03-23

RELEVANCE

7/ 10

AUTHOR

Intelligent-Form6624

// KEEP READING

More AI developer news from the feed

EXPLORE FULL FEED

MODEL1h ago

Prism ML launches Bonsai Image 4B variants

Prism ML has released Bonsai Image 4B, a compact text-to-image diffusion model family built from FLUX.2 Klein 4B for local inference on Apple Silicon and NVIDIA GPUs. The launch includes 1-bit and ternary variants, plus Bonsai Studio for trying the model on iPhone.

OPEN SOURCE1h ago

OpenMobius-skill packages ICT, SMC for agents

OpenMobius-skill turns ICT and smart money concepts into a reusable skill for Claude Code, Codex, OpenClaw, and Hermes, backed by 964 knowledge cards, live market data, and chart generation. Its 0.2.0 update on 2026-05-23 made the SMC structural indicator the default analysis path and added automatic overlays plus freshness disclosure.

OPEN SOURCE1h ago

Hallmark fights AI template sameness

Hallmark is an open-source design skill for Claude Code, Cursor, and Codex that pushes generated UIs away from samey, default-looking layouts. It varies macrostructure, theme, and layout, then runs style gates before handing work back.