BACK_TO_FEEDAICRIER_2
Qwen3.6-Plus throughput drops after idle
OPEN_SOURCE ↗
REDDIT · REDDIT// 7h agoBENCHMARK RESULT

Qwen3.6-Plus throughput drops after idle

A LocalLLaMA user reports 155-160 t/s on a 7900XTX at first boot, then a hard drop to 50 t/s after the machine sits idle for a while. The slowdown persists across context-size changes and only clears after a full PC reboot.

// ANALYSIS

Likely a GPU driver or power-state bug, not a model or context-window problem. The reboot-only recovery pattern is the strongest clue that the runtime or Radeon stack is getting stuck in a bad state.

  • The user says GPU temperature stays around 40C, which makes thermal throttling an unlikely explanation.
  • Context size changes from 32K down to 4K do not fix it, so this does not look like a simple memory-pressure issue.
  • Similar 7900XTX reports in the wild point to idle-power, clock-states, and driver regressions that only reset cleanly after reboot.
  • For local LLM users on AMD hardware, the next variables to isolate are driver version, backend/runtime changes, and any Windows power-management behavior.
// TAGS
qwen3.6-plusllmgpuinferencebenchmark

DISCOVERED

7h ago

2026-04-17

PUBLISHED

9h ago

2026-04-17

RELEVANCE

8/ 10

AUTHOR

soyalemujica