Supermicro V100 rig eyes local LLMs

// 67d agoINFRASTRUCTURE

Supermicro V100 rig eyes local LLMs

A Reddit user is eyeing a dirt-cheap 8x Tesla V100 setup for local LLMs, pairing used datacenter GPUs with an older Supermicro chassis and custom water cooling. The big catch is the chassis choice: the official 8x V100 NVLink platform is the SYS-4028GR-TVRT, while TXRT/TRT point at different GPU layouts and generations.

// ANALYSIS

This is a clever salvage-build idea, but it is much more likely to become a fun hardware project than a clean cost/perf king.

–Supermicro’s own docs show the SYS-4028GR-TVRT as the 8x Tesla V100 SXM2, 300 GB/s NVLink box; TXRT is the P100-era sibling, and TRT is a PCIe-gpu chassis.
–If you end up with PCIe V100s instead of SXM2 modules, the whole NVLink premise changes, so the exact GPU form factor matters as much as the price.
–128 GB of aggregate VRAM sounds huge, but it is still sharded memory across eight cards, so model parallelism, interconnect efficiency, and software support will decide real-world speed.
–Custom water cooling can make the thermals work, but it also adds leak risk, maintenance headaches, and another layer of failure on top of already old enterprise hardware.
–The value case is strongest if you want a tinkering lab and can tolerate rough edges; if you want a low-friction local inference box, simpler modern hardware is usually the saner buy.

// TAGS

supermicro-sys-4028gr-tvrttesla-v100llmgpuinferenceself-hostednvlink

DISCOVERED

67d ago

2026-03-21

PUBLISHED

68d ago

2026-03-21

RELEVANCE

8/ 10

AUTHOR

lethalratpoison

// KEEP READING

More AI developer news from the feed

EXPLORE FULL FEED

OPEN SOURCE12m ago

Agent-HTML swaps Markdown for interactive artifacts

Agent-HTML introduces a semantic HTML architecture designed for AI agents to generate stable, interactive "experience objects" instead of long-form Markdown. It bridges the gap between raw LLM output and high-fidelity, shareable engineering documents.

OPEN SOURCE12m ago

OpenBMB launches PilotDeck "agent OS" for WorkSpaces

PilotDeck is an open-source productivity platform that organizes AI agents into isolated "WorkSpaces" with dedicated file systems and memory. Developed by OpenBMB and Tsinghua University, it focuses on production-grade reliability and cost efficiency for complex, multi-project workflows.

OPEN SOURCE12m ago

make-pages-interactive adds live HTML commenting

A Claude Code skill that turns static HTML into an interactive surface for live feedback. Claude monitors a local inbox to automatically implement requested changes directly in the code.