GLM-5.1 hits 8-hour autonomous coding, tops SWE-Bench Pro
Zhipu AI has released GLM-5.1, a 754B parameter open-weights model capable of an 8-hour autonomous execution loop for full-stack engineering projects. The model topped the SWE-Bench Pro leaderboard with a score of 58.4, signaling a shift where open-source systems exceed proprietary capabilities in specialized tasks.
GLM-5.1 marks the transition from AI assistants to autonomous engineers capable of long-horizon project management. Its 8-hour autonomy window allows for thousands of tool calls and hundreds of iterations in a single session. Training on Huawei Ascend chips proves that state-of-the-art performance is achievable outside the Nvidia ecosystem. The combination of a 200,000-token context window and a 128,000-token output limit enables the generation and refactoring of massive codebases in one pass.
DISCOVERED
7h ago
2026-04-12
PUBLISHED
7h ago
2026-04-12
RELEVANCE
AUTHOR
AI Search