OpenCode, Qwopus-Coder build RTS game
Developer Kyle Hessling demonstrated building a complete real-time strategy (RTS) game using the open-source coding agent OpenCode and the upcoming Qwopus-Coder-35B-A3B model. The local setup achieved 253 tokens per second on an RTX 5090, utilizing Multi-Token Prediction (MTP) with a 96.7% draft acceptance rate.
Local reasoning models are becoming fast enough to power high-quality agentic developer workflows. Combining a lightweight terminal interface with multi-token prediction allows local models to match the speed and performance of cloud-hosted commercial coding assistants.
- –Achieving 253 tokens per second on an RTX 5090 indicates massive speedups from multi-token prediction when draft acceptance is high.
- –The 96.7% draft acceptance rate demonstrates that structured coding tasks are highly predictable and ideal for speculative decoding.
- –Enabling a 2,048-token thinking budget allows the model to reason and plan complex projects, such as building a complete game from scratch.
- –OpenCode provides a lightweight, model-agnostic alternative to commercial IDE extensions, giving developers complete control over their hardware and LLM choice.
DISCOVERED
1h ago
2026-06-26
PUBLISHED
3h ago
2026-06-26
RELEVANCE
AUTHOR
KyleHessling1