Ollama on Radeon 780M hits limits

// 90d agoINFRASTRUCTURE

Ollama on Radeon 780M hits limits

This Reddit post asks whether a Ryzen 7 8845HS mini PC with Radeon 780M and 32GB shared memory can run Ollama and Open WebUI alongside homelab workloads. It focuses on realistic model sizes, ROCm performance, and the practical ceiling once Proxmox services share system resources.

// ANALYSIS

The hot take: this is a sensible homelab experiment, but the ceiling is set less by peak benchmark numbers than by memory pressure, ROCm maturity, and how much contention the rest of the stack creates.

–The post is useful because it frames the real decision: not “can it run?” but “what model sizes stay pleasant once the machine is multitasking?”
–The 7B throughput claim is the least stable part of the discussion; actual tok/s will vary heavily with ROCm version, kernel, quantization, context length, and whether the iGPU is also serving other workloads.
–The most actionable part is the memory question: 32GB shared RAM sounds large, but Proxmox, containers, frame buffers, and device overhead can make the practical model ceiling noticeably lower than the theoretical maximum.
–For this class of hardware, sub-14B models are the likely sweet spot for mixed coding and writing use, with 7B-9B models probably giving the best balance of responsiveness and usability.
–The Proxmox LXC/ROCm angle is important because passing AMD graphics through cleanly is often more about driver compatibility and container plumbing than raw hardware capability.
–As a product story, this is really about Ollama as the local inference layer for a privacy-first, no-API-cost assistant setup.

// TAGS

ollamarocmamdryzenradeon 780mlocal-llmproxmoxhomelabopen-webuiinference

DISCOVERED

90d ago

2026-04-18

PUBLISHED

90d ago

2026-04-18

RELEVANCE

7/ 10

AUTHOR

Pablo_Gates

// KEEP READING

More AI developer news from the feed

EXPLORE FULL FEED

BENCHMARK23m ago

Runway Agent 2.0 tops Arc 1.0 benchmark

Runway detailed its engineering approach for Runway Agent 2.0, a conversational video generation and editing partner that topped Physion Labs' Arc 1.0 benchmark across all categories. The platform integrates media into a timeline interface, letting users iteratively transform briefs or performance data into cinematic video.

MODEL1h ago

Moonshot AI shares Kimi K3 pre-launch look

Ahead of the launch of their Kimi K3 large language model, the team at Chinese AI startup Moonshot AI shared a behind-the-scenes photo of their workspace. The post captures the excitement and high stakes surrounding the release, with team members expressing confidence that their office is a potential birthplace of Artificial General Intelligence (AGI).

NEWS1h ago

Claude Code praised as multi-model orchestrator

A user on X has highlighted Anthropic's Claude Code as the premier agentic harness for orchestrating other models and harnesses, specifically mentioning running GPT-5.6 Sol and Kimi K3. Although the user notes that Claude Code does not win in terms of pure coding performance and efficiency, they find its workflow management and coordination capabilities to be highly valuable for modern developer environments.