Devstral-Small-2-24B lands Opus reasoning tune

// 111d agoMODEL RELEASE

Devstral-Small-2-24B lands Opus reasoning tune

Adam Jenner fine-tuned Devstral-Small-2-24B on roughly 2.3k Claude 4.6 Opus reasoning traces, then shipped Q4_K_M and Q5_K_M GGUFs plus a LoRA adapter. The goal is to make the local coding model reason explicitly before it writes code, with Q5 recommended for best quality.

// ANALYSIS

This is less a SOTA chase than a behavior-packaging exercise: the value is in getting more deliberate coding out of a 24B model people can actually run locally.

–The ~2.3k-sample dataset is small, so expect a planning/style lift rather than a dramatic jump in raw capability.
–The VLM-to-text-only extraction is the real feat; it makes a multimodal base trainable on a single RTX 3090.
–GGUFs plus a LoRA adapter are the right packaging for this audience: easy local testing, easy merges, easy Q4 vs Q5 comparisons.
–The epoch-2 checkpoint choice, 2,048-token cap, and no-benchmark caveat suggest the model is tuned for practical generalization, not leaderboard theater.
–Apache 2.0 licensing and open weights make it a useful sandbox for local coding agents and reasoning distills.

// TAGS

devstral-small-2-24b-opus-reasoningreasoningfine-tuningai-codingopen-weightsllm

DISCOVERED

111d ago

2026-03-24

PUBLISHED

111d ago

2026-03-24

RELEVANCE

8/ 10

AUTHOR

admajic

// KEEP READING

More AI developer news from the feed

EXPLORE FULL FEED

MODEL20m ago

OpenAI GPT-5.6 hits Amazon Bedrock

OpenAI's GPT-5.6 model family—including Sol, Terra, and Luna—is now generally available on Amazon Bedrock. Running on Bedrock's next-generation inference engine, the models support prompt caching with a 90% discount and match OpenAI's first-party pricing.

UPDATE1h ago

OpenRouter splits rankings by model weight

OpenRouter has updated its rankings platform by introducing separate leaderboards for open-weight and closed-weight models. This allows developers to track and compare usage statistics of proprietary, API-exclusive models against downloadable open-weight models.

UPDATE1h ago

Codex and Claude Code introduce advanced in-app browser capabilities, including multi-tab support and cookie imports, accelerating the shift toward autonomous computer use.

Codex has updated its in-app browser to support multiple tabs, cookie importing, and password persistence, with Anthropic's Claude Code quickly following with similar web-browsing capabilities. These upgrades allow AI agents to navigate authenticated sites and perform browser-based tasks alongside code editors and terminals. By embedding robust browser control directly into the agentic environment, developers can execute end-to-end workflows without leaving the command line or workspace app.