MiniMax launches ultra-fast M3 model

// 45d agoMODEL RELEASE

MiniMax launches ultra-fast M3 model

MiniMax has announced MiniMax M3, a brand new model architecture featuring a 1-million-token context window, native video input support, and up to 15.6x faster decoding speeds. The model is priced disruptively at $0.30 per million input tokens and $1.20 per million output tokens, positioning it as a highly competitive and efficient multimodal option.

// ANALYSIS

This is a massive shot across the bow for frontier LLM providers, proving that the race for long-context models is rapidly shifting from capability to pure, optimized inference speed at dirt-cheap prices. If these performance and speed claims hold up under real-world workloads, it will make long-context agentic workflows and real-time video analysis incredibly practical and affordable.

–**Incredible Price-to-Performance Ratio**: At $0.30/1M input and $1.20/1M output, MiniMax M3 is aggressively priced, undercutting many existing long-context offerings.
–**Architectural Breakthrough**: The claimed 15.6x faster decoding speed at a full 1M token context suggests an incredibly efficient implementation of sparse attention that solves key latency bottlenecks.
–**Native Multimodality**: Native support for video inputs alongside large text contexts opens up powerful new opportunities for real-world video processing, summarization, and interactive agents.
–**Pressure on Competitors**: A massive speed and cost disruption like this will force other model providers to prioritize inference optimization and pricing drops.

// TAGS

minimax-m3llmlong-contextmultimodalvideo-aisparse-attentionmodel-release

DISCOVERED

45d ago

2026-06-01

PUBLISHED

45d ago

2026-06-01

RELEVANCE

8/ 10

AUTHOR

bridgemindai

// KEEP READING

More AI developer news from the feed

EXPLORE FULL FEED

RESEARCH2h ago

GPT-5.6 Sol Pro disproves Benjamini-Hochberg conjecture

University of Pennsylvania professor Edgar Dobriban utilized OpenAI's GPT-5.6 Sol Pro to disprove a 30-year-old conjecture about the Benjamini-Hochberg procedure under correlated tests. Running in Pro mode, the reasoning model generated a mathematical proof and numerical certificate verifying the failure in 90 minutes.

OPEN SOURCE3h ago

Prismor launches AI agent runtime firewall

Prismor is an open-source runtime firewall and security control plane that intercepts and validates AI agent tool calls in real time. Sitting at the tool-call boundary, it enforces cryptographically signed policies and maintains detailed audit trails to prevent prompt injections, secret leaks, and unauthorized commands.

MODEL4h ago

DeepSeek V4, Kimi K3 dropping soon

The upcoming releases of DeepSeek V4 GA and Moonshot AI's Kimi K3 represent a highly anticipated next step for the Chinese AI ecosystem, with early builds of the models showing highly impressive capabilities that could replicate the impact of the DeepSeek-R1 release.