Step 3.7 Flash launches on DeepInfra
DeepInfra has launched serverless API access for Step 3.7 Flash, a 198B-parameter sparse Mixture-of-Experts (MoE) vision-language model developed by StepFun. The model is specifically optimized for complex agentic workloads and features a 256K context window with selectable reasoning effort levels.
Specializing models for agentic workflows rather than general-purpose chat is the next step in AI utility, and DeepInfra hosting it makes production agent loops much cheaper.
* Step 3.7 Flash's selectable reasoning effort solves a major pain point for developers by offering a programmatic way to optimize API costs.
* Scoring 56.3 on SWE-bench Pro indicates that open-weights models are becoming highly competitive with proprietary models for agent tasks.
* Providing this on a serverless, pay-as-you-go API lowers the barrier to entry for building complex, long-running agent systems.
DISCOVERED
1h ago
2026-06-13
PUBLISHED
1h ago
2026-06-13
RELEVANCE
AUTHOR
DeepInfra