OPEN_SOURCE ↗
REDDIT · REDDIT// 4h agoBENCHMARK RESULT
Intel Arc Pro B70 hits 2.2k tps
Early benchmarks of Intel's Battlemage-based Arc Pro B70 show impressive 2,200 tps ingestion speeds for Qwen 3.5 27B, though driver and software stability for local LLMs remain a significant hurdle.
// ANALYSIS
Intel's Xe2 architecture is finally showing its teeth in AI, but the "Intel Tax" of messy software setup is still very real for early adopters.
- –Ingestion performance (2,200 tps) is highly competitive, but generation speeds (~10 tps) suggest a massive bottleneck in the current vLLM/SYCL implementations for Battlemage.
- –vLLM with the `intel/llm-scaler-vllm` docker image is the only reliable path for high performance, as native `llama.cpp` backends (OpenVINO, Vulkan) are still throwing critical tensor errors.
- –The 32GB VRAM on the Pro variants makes 27B+ models viable locally, but the requirement for Ubuntu 26.04 or Fedora 43 for proper kernel support is a steep barrier for casual users.
- –Sub-par generation speeds relative to ingestion highlight that Intel's XMX engines are underutilized in current autoregressive sampling loops compared to NVIDIA's Tensor Cores.
// TAGS
intel-arc-pro-b70gpullmvllmqwenbattlemagebenchmark
DISCOVERED
4h ago
2026-04-15
PUBLISHED
5h ago
2026-04-14
RELEVANCE
8/ 10
AUTHOR
Gesha24