M5 Max owners test on-battery inference
The Reddit thread asks whether M5 Max laptops can sustain useful local LLM inference on battery without the speed and battery-life tradeoffs Strix Halo users run into. It’s a practical check on whether Apple’s efficiency advantage translates into real portable AI work, not just benchmark marketing.
If you care about running local models unplugged, this is the right question: sustained tok/s per watt matters more than peak throughput.
- –The comparison is really about usable performance while capped by battery and thermal limits, not charger-attached benchmark numbers.
- –Model size, quantization, and runtime choice will move the needle a lot; MLX, Ollama, and how well the stack uses Apple’s accelerators all matter.
- –If M5 Max stays responsive on battery without collapsing under throttling, it becomes a stronger portable inference platform than many x86/AMD laptops.
- –The thread reflects where local AI hardware buying is headed: developers now judge laptops by unplugged inference behavior, not just specs on paper.
DISCOVERED
45d ago
2026-04-17
PUBLISHED
45d ago
2026-04-16
RELEVANCE
AUTHOR
spaceman_