Step 3.7 Flash launches on DeepInfra

// 45d agoMODEL RELEASE

Step 3.7 Flash launches on DeepInfra

DeepInfra has launched serverless API access for Step 3.7 Flash, a 198B-parameter sparse Mixture-of-Experts (MoE) vision-language model developed by StepFun. The model is specifically optimized for complex agentic workloads and features a 256K context window with selectable reasoning effort levels.

// ANALYSIS

Specializing models for agentic workflows rather than general-purpose chat is the next step in AI utility, and DeepInfra hosting it makes production agent loops much cheaper.

* Step 3.7 Flash's selectable reasoning effort solves a major pain point for developers by offering a programmatic way to optimize API costs.

* Scoring 56.3 on SWE-bench Pro indicates that open-weights models are becoming highly competitive with proprietary models for agent tasks.

* Providing this on a serverless, pay-as-you-go API lowers the barrier to entry for building complex, long-running agent systems.

// TAGS

step-3.7-flashdeepinfrastepfunmoemultimodalai-agentsapi

DISCOVERED

45d ago

2026-06-13

PUBLISHED

45d ago

2026-06-13

RELEVANCE

8/ 10

AUTHOR

DeepInfra

// KEEP READING

More AI developer news from the feed

EXPLORE FULL FEED

MODEL4m ago

Anthropic red teams Fable 5.1 for August release

Anthropic has deployed Fable 5.1 into its red teaming portal for beta stress-testing ahead of an expected public launch. The new model aims to succeed Fable 5 with updated capabilities and performance enhancements, following recent pricing adjustments across Anthropic's model lineup.

MODEL4m ago

xAI schedules August release for Grok 4.6

xAI has officially scheduled the launch of its next-generation Grok 4.6 and Grok 4.7 models for August, with Grok 4.6 targeted to release in two weeks followed by Grok 4.7 two weeks later. According to Elon Musk, the upcoming models will scale up to a reported 10 trillion parameters to advance performance in complex reasoning.

MODEL4m ago

Gemini 4 pre-training checkpoints hit LMSYS Arena

Initial pre-training checkpoints for Google's Gemini 4 model family have surfaced on LMSYS Arena for blind benchmarking. Early demonstrations highlight substantial rendering improvements for complex 3D WebGL simulations compared to Gemini 3.6 Flash.