DeepReinforce open-sources Ornith-1.0 coding models

// 1d agoMODEL RELEASE

DeepReinforce open-sources Ornith-1.0 coding models

DeepReinforce AI has released Ornith-1.0, an MIT-licensed family of open-source coding models built on Gemma 4 and Qwen 3.5 architectures. The models utilize a self-improving reinforcement learning strategy to optimize code generation and scaffold execution trajectories.

// ANALYSIS

DeepReinforce's RL-driven scaffolding training approach demonstrates how open-source models can achieve SOTA agentic performance without massive parameter scaling. This family could significantly lower the cost of running local code-editing agents.

–The self-improving RL scaffold generation allows the models to optimize search trajectories, leading to superior bug localization and multi-file refactoring.
–With models ranging from 9B dense to a massive 397B MoE, developers can run capable agentic models locally on edge devices or host them on high-throughput infrastructure.
–Achieving 82.4 on SWE-bench verified places the 397B MoE variant among top-tier open-source reasoning models.
–Native compatibility with vLLM, SGLang, and standard OpenAI-compatible APIs makes drop-in replacement in current agent architectures trivial.
–Released under an MIT license, it offers a fully permissive alternative for enterprise environments requiring strict licensing compliance.

// TAGS

ornith-1-0llmopen-weightsreasoningai-codingcoding-agentagent

DISCOVERED

1d ago

2026-06-25

PUBLISHED

1d ago

2026-06-25

RELEVANCE

9/ 10

AUTHOR

WorldofAI

// KEEP READING

More AI developer news from the feed

EXPLORE FULL FEED

UPDATE54m ago

Mintlify assistant routes users directly to pages

Mintlify has updated its AI documentation assistant to automatically redirect users to the exact page they are looking for based on their query intent. This feature speeds up documentation navigation by bypassing chat responses when the user's destination is clear.

POLICY54m ago

US lifts Claude Mythos 5 ban

The U.S. government has lifted the ban on Anthropic's Claude Mythos 5 model, allowing distribution to over 100 American institutions. The cybersecurity-focused model had been taken offline globally due to initial security and jailbreak concerns.

TUTORIAL2h ago

Git worktrees unlock Claude Code parallelism

Anthropic's Claude Code CLI uses native git worktrees to run multiple independent agent sessions in parallel. This prevents file collisions and allows developers to multitask across different branches without interrupting active agent runs.