RX 6900 XT Benchmarks Split ROCm, Vulkan

// 90d agoBENCHMARK RESULT

RX 6900 XT Benchmarks Split ROCm, Vulkan

A Reddit user shared quick llama.cpp benchmarks on an AMD Radeon RX 6900 XT after building with ROCm 6.4.2 and the latest Vulkan backend. The results show a split picture: Vulkan often leads token generation throughput, while ROCm can be faster on prompt processing for some workloads, especially Qwen 3.5 4B Q8_0. For Gemma 4 E2B Q4_K, the winner shifts with ubatch size, which reinforces that backend performance here is workload-dependent rather than universally favoring one stack.

// ANALYSIS

ROCm is not a blanket win or loss here; it looks stronger for prompt processing on the Qwen run, while Vulkan is more consistently better at token generation.

–Gemma 4 E2B Q4_K is mixed: Vulkan wins pp512 at larger ubatches, but ROCm holds the edge on tg128.
–Qwen 3.5 4B Q8_0 is clearer: ROCm is substantially faster on pp512 across all tested ubatches, while Vulkan is faster on tg128.
–The post is a quick local benchmark, not a controlled long-context study, so it is best read as a practical snapshot for AMD RDNA2 users.

// TAGS

amdradeon rx 6900 xtrocmvulkanllama.cppgemma 4qwen 3.5local llmgpu benchmark

DISCOVERED

90d ago

2026-04-28

PUBLISHED

90d ago

2026-04-28

RELEVANCE

8/ 10

AUTHOR

grumd

// KEEP READING

More AI developer news from the feed

EXPLORE FULL FEED

UPDATE59m ago

Zed v1.12.1 adds Claude Opus 5 BYOK support

Zed Industries has released version 1.12.1 of its high-performance, open-source code editor. This update adds support for Claude Opus 5 across both Anthropic and Amazon Bedrock Bring-Your-Own-Key (BYOK) providers, giving developers access to advanced AI model options directly within their development environment.

UPDATE1h ago

OpenCode offers fast US access to Moonshot Kimi 2.8T

Dax Raad (@thdxr) announced that OpenCode provides high-speed, US-based hosting for Moonshot AI's 2.8T Kimi model. This setup allows developers to leverage the massive open-weights model directly within their coding environments with low latency and improved data residency compliance compared to overseas endpoints.

UPDATE2h ago

Forecast AI previews token-gated browser agent swarm

Forecast AI announced a preview of holder utility for its $FORAI token, revealing a three-tier access system for upcoming on-site agent runs. The web-based feature allows users to run Forecast AI's 7-agent swarm directly at forai.tech without needing any local installation.