DeepSeek-R1 Runs 30 Minutes Uncapped in LM Studio

// 90d agoTUTORIAL

DeepSeek-R1 Runs 30 Minutes Uncapped in LM Studio

A Reddit user says DeepSeek-R1-0528-Qwen3-8B-Q4_K_M in LM Studio kept “thinking” for roughly 30 minutes before they manually stopped it, and asks how to limit reasoning to about 2 minutes. The post is really about controlling local reasoning-model runtime behavior, not a product launch.

// ANALYSIS

Hot take: this is usually a configuration problem, not the model “going crazy.”

–Local reasoning models can keep producing chain-of-thought until they hit a token limit or an external stop condition, so an uncapped session can run far longer than expected.
–The most reliable mitigation is to set a hard cap on output tokens or a wall-clock timeout in LM Studio or the serving layer.
–Prompting for “brief reasoning” can help, but it is not a hard guarantee; the model may still ramble if the stop conditions are loose.
–If you want something closer to a 2-minute ceiling, use a smaller context window, lower max tokens, and a stop rule in the client rather than relying on the prompt alone.

// TAGS

deepseeklm studiolocal llmreasoning modelqweninferencequantization

DISCOVERED

90d ago

2026-04-16

PUBLISHED

91d ago

2026-04-16

RELEVANCE

6/ 10

AUTHOR

XEUIPR

// KEEP READING

More AI developer news from the feed

EXPLORE FULL FEED

OPEN SOURCE1h ago

Prismor launches AI agent runtime firewall

Prismor is an open-source runtime firewall and security control plane that intercepts and validates AI agent tool calls in real time. Sitting at the tool-call boundary, it enforces cryptographically signed policies and maintains detailed audit trails to prevent prompt injections, secret leaks, and unauthorized commands.

MODEL3h ago

DeepSeek V4, Kimi K3 dropping soon

The upcoming releases of DeepSeek V4 GA and Moonshot AI's Kimi K3 represent a highly anticipated next step for the Chinese AI ecosystem, with early builds of the models showing highly impressive capabilities that could replicate the impact of the DeepSeek-R1 release.

NEWS3h ago

Sakana AI, NVIDIA partner on Fugu

Sakana AI partnered with NVIDIA to integrate leading open-weights models like Nemotron into its Fugu multi-agent orchestration platform. The collaboration aims to boost routing efficiency and support Japan's sovereign AI infrastructure.