Ornith-1.0 drops with self-improving scaffolding
DeepReinforce has released Ornith-1.0, an open-source family of agentic coding models ranging from 9B to 397B parameters. The models train themselves to generate both code solutions and task-specific scaffold orchestration.
Ornith-1.0 is a major breakthrough for open-source agents, proving that reinforcement learning can optimize both task execution and execution scaffolds.
- –Direct training on scaffold generation reduces developer effort in manually designing complex execution environments
- –Built-in reward-hacking mitigation solves a critical issue where RL agents write passing tests instead of correct code
- –Broad range of sizes, from 9B dense to a 397B MoE flagship, enables deployments from local edge dev environments to massive enterprise clusters
- –Outperforming peers on SWE-Bench Verified (82.4) demonstrates that self-generated scaffolding matches or exceeds hand-crafted developer loops
DISCOVERED
1h ago
2026-06-25
PUBLISHED
1h ago
2026-06-25
RELEVANCE
AUTHOR
DAIEvolutionHub