OpenRouter serves GLM 5.2 at 125 TPS

// 1h agoINFRASTRUCTURE

OpenRouter serves GLM 5.2 at 125 TPS

OpenRouter aggregates 22 providers for Z.ai's GLM 5.2 reasoning model, with throughput speeds exceeding 125 tokens per second. The setup provides developers with high-uptime, redundant API routing for long-context coding workflows.

// ANALYSIS

OpenRouter's multi-provider routing for GLM 5.2 highlights a growing trend of API aggregation to solve LLM uptime and latency bottlenecks.

–The 125+ TPS throughput makes GLM 5.2 highly viable for fast, interactive agentic coding workflows.
–Aggregating 22 providers ensures high uptime and mitigates single-point-of-failure risks for project-level automation.
–At $0.95/M input and $3/M output, GLM 5.2 on OpenRouter offers competitive pricing compared to direct Z.ai subscriptions.
–Support for `high` and `xhigh` reasoning effort levels allows developers to customize compute cost based on task complexity.

// TAGS

openrouterglm-5-2llminferenceapihosted-servicereasoning

DISCOVERED

1h ago

2026-06-23

PUBLISHED

1h ago

2026-06-23

RELEVANCE

8/ 10

AUTHOR

OpenRouter

// KEEP READING

More AI developer news from the feed

EXPLORE FULL FEED

OPEN SOURCE15m ago

tsbootstrap v0.3.0 drops with MCP, compiled acceleration

Open-source time series resampling library tsbootstrap has launched v0.3.0, introducing a read-only Model Context Protocol (MCP) server for AI coding agents and a compiled parallel-replicate VAR kernel for performance gains.

NEWS28m ago

Project Genie wins Cannes Lions Grand Prix

Google DeepMind's Project Genie has won the Grand Prix in Digital Craft at the 2026 Cannes Lions. The experimental world model was recognized for bridging frontier AI research with functional, interactive environments.

NEWS47m ago

Karpathy outlines new inline Claude paradigm

Andrej Karpathy highlighted a paradigm shift in interacting with Claude, emphasizing a terminal-based, file-centric workflow that aligns with how humans organize information. The approach leverages directory structures and persistent context files like CLAUDE.md to steer the AI agent more effectively.