BACK_TO_FEEDAICRIER_2
Raspberry Pi 5 runs 122B LLMs via SSD swap
OPEN_SOURCE ↗
REDDIT · REDDIT// 11d agoBENCHMARK RESULT

Raspberry Pi 5 runs 122B LLMs via SSD swap

A 16GB Raspberry Pi 5 successfully ran models up to 122B Qwen MoE using a 450GB SSD swap partition. Performance ranged from 11 tokens per second on sub-billion parameter models to 0.17 tokens per second for massive architectures.

// ANALYSIS

This technical stress test demonstrates that "local LLM" is a spectrum from interactive edge applications to slow-motion academic curiosities. Sub-4B parameter models like Qwen 0.8B and 2B maintain usable speeds of 5–11 t/s, while SSD-backed swap allows loading massive 122B models that would otherwise fail. Thermal data confirms a "wait-bound" cooling effect, as CPU temperatures drop while idling for I/O during heavy swap operations.

// TAGS
raspberry-pi-5llmbenchmarksqwen-3-5gemma-3llama-cppedge-aihardwareswap

DISCOVERED

11d ago

2026-03-31

PUBLISHED

11d ago

2026-03-31

RELEVANCE

6/ 10

AUTHOR

honuvo