OpenRouter adds real-time cache hit rates to pricing
OpenRouter has added real-time cache hit rates and historical traffic metrics to its pricing tab, allowing developers to see actual provider performance data for models like Claude Opus 4.8. By visualizing these caching metrics directly, users can better understand the variance in pricing and choose model providers based on actual, effective costs rather than nominal list rates.
Showing real-time cache hit rates exposes the actual, dynamic cost of LLMs and shifts developer focus from list pricing to effective pricing. Developers can now make data-driven decisions on model routing based on actual caching performance. Exposing historical traffic adds another layer of transparency to help predict provider reliability and performance. This pressures other model aggregators and direct API providers to offer similar observability metrics.
DISCOVERED
1h ago
2026-06-07
PUBLISHED
2h ago
2026-06-07
RELEVANCE
AUTHOR
OpenRouter