OPEN_SOURCE ↗
REDDIT · REDDIT// 11d agoBENCHMARK RESULT
Raspberry Pi 5 runs 122B LLMs via SSD swap
A 16GB Raspberry Pi 5 successfully ran models up to 122B Qwen MoE using a 450GB SSD swap partition. Performance ranged from 11 tokens per second on sub-billion parameter models to 0.17 tokens per second for massive architectures.
// ANALYSIS
This technical stress test demonstrates that "local LLM" is a spectrum from interactive edge applications to slow-motion academic curiosities. Sub-4B parameter models like Qwen 0.8B and 2B maintain usable speeds of 5–11 t/s, while SSD-backed swap allows loading massive 122B models that would otherwise fail. Thermal data confirms a "wait-bound" cooling effect, as CPU temperatures drop while idling for I/O during heavy swap operations.
// TAGS
raspberry-pi-5llmbenchmarksqwen-3-5gemma-3llama-cppedge-aihardwareswap
DISCOVERED
11d ago
2026-03-31
PUBLISHED
11d ago
2026-03-31
RELEVANCE
6/ 10
AUTHOR
honuvo