DeepInfra launches serverless GLM-5.2 API
DeepInfra has launched serverless API support for Z.ai's GLM-5.2, a 753-billion parameter flagship model with a 1-million-token context window and configurable reasoning modes. Hosting the MIT-licensed model on its infrastructure allows developers to integrate GLM-5.2 into agentic workflows without managing massive model weights.
Providing serverless access to massive flagship open-source models on day zero significantly accelerates the adoption of advanced coding agents.
* Offering API access to a 753B parameter model lowers the hardware barrier for developers building autonomous AI agents.
* Configurable thinking effort modes ("High" and "Max") paired with a 1M-token context window provide fine-grained control over API cost and latency trade-offs.
* Open-source models with MIT licensing hosted on serverless infrastructure present a direct challenge to closed-source frontier models in developer workflows.
DISCOVERED
1d ago
2026-06-16
PUBLISHED
1d ago
2026-06-16
RELEVANCE
AUTHOR
DeepInfra