BACK_TO_FEEDAICRIER_2
RX 6900 XT Benchmarks Split ROCm, Vulkan
OPEN_SOURCE ↗
REDDIT · REDDIT// 2h agoBENCHMARK RESULT

RX 6900 XT Benchmarks Split ROCm, Vulkan

A Reddit user shared quick llama.cpp benchmarks on an AMD Radeon RX 6900 XT after building with ROCm 6.4.2 and the latest Vulkan backend. The results show a split picture: Vulkan often leads token generation throughput, while ROCm can be faster on prompt processing for some workloads, especially Qwen 3.5 4B Q8_0. For Gemma 4 E2B Q4_K, the winner shifts with ubatch size, which reinforces that backend performance here is workload-dependent rather than universally favoring one stack.

// ANALYSIS

ROCm is not a blanket win or loss here; it looks stronger for prompt processing on the Qwen run, while Vulkan is more consistently better at token generation.

  • Gemma 4 E2B Q4_K is mixed: Vulkan wins pp512 at larger ubatches, but ROCm holds the edge on tg128.
  • Qwen 3.5 4B Q8_0 is clearer: ROCm is substantially faster on pp512 across all tested ubatches, while Vulkan is faster on tg128.
  • The post is a quick local benchmark, not a controlled long-context study, so it is best read as a practical snapshot for AMD RDNA2 users.
// TAGS
amdradeon rx 6900 xtrocmvulkanllama.cppgemma 4qwen 3.5local llmgpu benchmark

DISCOVERED

2h ago

2026-04-28

PUBLISHED

4h ago

2026-04-28

RELEVANCE

8/ 10

AUTHOR

grumd