OPEN_SOURCE ↗
HN · HACKER_NEWS// 4h agoINFRASTRUCTURE
Mendral cuts costs after Opus upgrade
Mendral says its CI-triage system now runs cheaper on Opus 4.6 than it did on Sonnet 4.0 because a narrow Haiku triager filters most failures before they reach the expensive model. The post argues that the win comes from model routing, scoped sub-agents, and letting agents pull just the context they need.
// ANALYSIS
The real lesson is that frontier models can reduce spend when they become the planner, not the workhorse. Better orchestration beats brute-force prompting.
- –Mendral reports that 4 out of 5 failures never reach Opus, so most volume is handled by a cheap classifier instead of the premium model
- –The system uses Opus for hypothesis generation and Haiku for log reading and targeted retrieval, which keeps the expensive context clean
- –Pushing logs into prompts is replaced by SQL against ClickHouse, which is a more scalable pattern for high-volume debugging data
- –Capping sub-agent fan-out matters as much as model choice, because runaway delegation can erase any token savings
- –The architecture is most relevant for teams handling noisy event streams like CI, security alerts, or telemetry, not just software build failures
// TAGS
mendralllmagentinferencepricingautomation
DISCOVERED
4h ago
2026-04-29
PUBLISHED
8h ago
2026-04-29
RELEVANCE
8/ 10
AUTHOR
shad42