Qwen3-235B-A22B runs on X13, 4 A100s

// 60d agoINFRASTRUCTURE

Qwen3-235B-A22B runs on X13, 4 A100s

A LocalLLaMA user shows off an X13 server with dual Xeon Silver 4415 CPUs, 1 TB of RAM, and four Nvidia A100s that appears aimed at running Qwen3-235B-A22B. It’s a useful snapshot of what “local” looks like once you move into frontier open-weight models.

// ANALYSIS

Open-weight does not mean lightweight, and this rig is a good reminder that frontier self-hosting still looks a lot like a mini-datacenter. The good news is that Qwen’s software stack is mature enough that this kind of deployment is realistic rather than purely aspirational.

–Qwen3-235B-A22B is a 235B-parameter MoE model with 22B active parameters, so the name understates how much inference machinery is still involved.
–Qwen’s official docs show the model being served with multi-GPU tensor parallelism, including 8-way BF16 and 4-way FP8/quantized setups, which makes four A100s a credible target.
–The 1 TB RAM and dual Xeons likely matter as much as the GPUs for KV cache, host-side sharding, and keeping long-context inference stable.
–Apache 2.0 licensing plus support in vLLM, SGLang, llama.cpp, Ollama, LM Studio, and TensorRT-LLM lowers software friction, but hardware remains the real moat.

// TAGS

qwen3-235b-a22bllmgpuinferenceself-hostedopen-weights

DISCOVERED

60d ago

2026-03-29

PUBLISHED

60d ago

2026-03-29

RELEVANCE

8/ 10

AUTHOR

AutomaticBedroom3870

// KEEP READING

More AI developer news from the feed

EXPLORE FULL FEED

UPDATE56m ago

Grok Build widens access, adds subagents

xAI’s Grok Build is an early-beta terminal coding agent with plan-review-approve flows, parallel subagents, worktree isolation, and support for plugins, hooks, skills, and MCP. The latest improvements make it feel less like a demo and more like xAI’s bid to compete seriously in the AI coding CLI race.

MODEL1h ago

Krea 2 lands on Replicate

Krea 2 is now available on Replicate, giving developers access to Krea's style-first image model outside the Krea app. It emphasizes aesthetic diversity, style control, and reference-driven creative workflows.

MODEL1h ago

ElevenLabs launches Music v2 for creators

ElevenLabs has released Music v2, a new music generation model that improves vocals, instrumentation, arrangement, and multilingual output. The model supports longer, section-by-section composition, inpainting to regenerate specific parts of a track, and more complex shifts within a song without losing coherence. It powers ElevenMusic and ElevenCreative now, with ElevenAPI access coming soon, and is trained on licensed data for commercial use.