Mistral Small 4 drops 119B MoE model

// 72d agoMODEL RELEASE

Mistral Small 4 drops 119B MoE model

Mistral AI launches Mistral Small 4 (119B-2603), a unified Mixture-of-Experts model integrating instruction, reasoning, and coding capabilities. The Apache 2.0 release features a 256k context window and a configurable "Reasoning Effort" mode for deep problem-solving.

// ANALYSIS

Mistral Small 4 is a major architectural pivot that brings 100B+ tier reasoning to the Small family via high-density MoE. It uses 128 experts with 4 active per token to keep activated parameters at just 6.5B, maintaining high inference speed despite the 119B total size. The release unifies the Instruct, Reasoning (Magistral), and Devstral (coding) models into a single all-rounder optimized for local deployment. A new reasoning_effort parameter allows users to trade time for depth, signaling a shift toward test-time compute for local models. Apache 2.0 licensing undercuts proprietary competitors like GPT-4o mini and Claude 3.5 Haiku by offering a royalty-free alternative for 100B-class performance on consumer-accessible hardware like high-memory Macs.

// TAGS

mistral-small-4llmmo-eopen-weightsai-codingreasoningmultimodal

DISCOVERED

72d ago

2026-03-16

PUBLISHED

72d ago

2026-03-16

RELEVANCE

10/ 10

AUTHOR

seamonn

// KEEP READING

More AI developer news from the feed

EXPLORE FULL FEED

UPDATE2m ago

Claude Code adds automated fixes, persistent model defaults

Claude Code v2.1.153 introduces `/code-review --fix` to automatically apply suggested improvements and persists model selections as defaults. The update also ships critical security patches for OAuth credentials and resolves major memory leaks for long-running sessions.

NEWS22m ago

Midjourney founder: diffusion wins as FLOPS outpace memory

David Holz argues that diffusion models are the superior long-term architecture because they scale with cheap compute (FLOPS) while autoregressive models remain bottlenecked by expensive memory bandwidth.

NEWS28m ago

Coinbase builds read-only Temporal MCP server

Coinbase engineers developed a read-only Model Context Protocol (MCP) server that lets AI assistants debug Temporal workflows directly from code editors. The tool enables natural language troubleshooting by correlating live production state with local source code.