OpenRouter launches Microsoft MAI models
OpenRouter has added support for three new Microsoft AI models: MAI-Image-2.5, MAI-Transcribe-1.5, and MAI-Voice-2. These models offer specialized, high-performance capabilities for image generation, speech-to-text transcription, and text-to-speech voice synthesis.
Microsoft is building a highly competitive, multi-modal alternative to OpenAI's ecosystem by releasing specialized, budget-friendly models across image, text-to-speech, and speech-to-text.
- –MAI-Image-2.5 challenges mainstream image models with market-leading quality-per-dollar and top-tier leaderboard performance.
- –MAI-Transcribe-1.5 offers ultra-fast transcription (1 hour of audio in under 15 seconds) paired with SOTA accuracy, directly competing with Whisper.
- –MAI-Voice-2 provides emotional expression control and long-form consistency, which are critical for media production and voice agents.
DISCOVERED
1h ago
2026-06-02
PUBLISHED
2h ago
2026-06-02
RELEVANCE
AUTHOR
OpenRouter