GLM-5.2 sets open-source ARC-AGI-2 record

// 1h agoBENCHMARK RESULT

GLM-5.2 sets open-source ARC-AGI-2 record

Z.ai's 744B open-weight model GLM-5.2 achieves a 22.8% score on the ARC-AGI-2 benchmark, marking the strongest performance to date for an open-source model. The model features agentic capabilities and a 1M-token context window designed for long-horizon software engineering tasks.

// ANALYSIS

While still trailing top closed-source models, GLM-5.2's ARC-AGI-2 performance signals serious fluid reasoning capabilities in the open-weight ecosystem.

–22.8% score demonstrates early agentic reasoning capacity previously restricted to proprietary APIs
–Under-the-hood architecture includes IndexShare and improved multi-token prediction to drastically reduce inference costs
–1M-token context window and MIT license position it as a viable local alternative for repository-scale coding agents
–Maintains the typical 6-12 month performance gap behind frontier models like GPT-5.5, which recently hit 85%

// TAGS

glm-5.2z.aibenchmarkreasoningopen-weightsllmagentopen-source

DISCOVERED

1h ago

2026-06-25

PUBLISHED

12h ago

2026-06-24

RELEVANCE

9/ 10

AUTHOR

fchollet

// KEEP READING

More AI developer news from the feed

EXPLORE FULL FEED

UPDATE3h ago

Cursor runs coding agents from CI

Cursor introduces remote, VM-backed background agents that can be triggered directly from CI pipelines and persist through local network disconnections. The agents run asynchronously in isolated cloud sandboxes, allowing developers to offload long-running tasks and receive completed pull requests hours later.

NEWS4h ago

Tesana user builds playable Backrooms game

A creator leveraged Tesana's prompt-to-world AI engine to build a playable Backrooms game following the release of the new Backrooms movie. The project demonstrates the platform's ability to rapidly generate topical 3D experiences without traditional game development.

NEWS6h ago

LuaJIT 3.0 proposes modern syntax extensions

Mike Pall has proposed a set of modern syntax extensions for LuaJIT 3.0, introducing features like nil-coalescing, optional chaining, and compound assignment. These features aim to improve developer quality-of-life and will be backported to LuaJIT 2.1 to ease compiler bootstrapping.