OpenRouter serves GLM 5.2 at 125 TPS
OpenRouter aggregates 22 providers for Z.ai's GLM 5.2 reasoning model, with throughput speeds exceeding 125 tokens per second. The setup provides developers with high-uptime, redundant API routing for long-context coding workflows.
OpenRouter's multi-provider routing for GLM 5.2 highlights a growing trend of API aggregation to solve LLM uptime and latency bottlenecks.
- –The 125+ TPS throughput makes GLM 5.2 highly viable for fast, interactive agentic coding workflows.
- –Aggregating 22 providers ensures high uptime and mitigates single-point-of-failure risks for project-level automation.
- –At $0.95/M input and $3/M output, GLM 5.2 on OpenRouter offers competitive pricing compared to direct Z.ai subscriptions.
- –Support for `high` and `xhigh` reasoning effort levels allows developers to customize compute cost based on task complexity.
DISCOVERED
1h ago
2026-06-23
PUBLISHED
1h ago
2026-06-23
RELEVANCE
AUTHOR
OpenRouter