Medical STT benchmark v4 reshuffles rankings

// 50d agoBENCHMARK RESULT

Medical STT benchmark v4 reshuffles rankings

Omi-Health's updated benchmark evaluates 43 speech-to-text models using a new Medical-WER metric that prioritizes clinically relevant terms. Gemini 3 Pro Preview tops the board, while Microsoft's open-source VibeVoice-ASR 9B outperforms MAI-Transcribe-1.

// ANALYSIS

Standard Word Error Rate (WER) is a dangerous metric for medical AI when it weights filler words equally with life-critical drug names.

–Medical-WER (M-WER) reveals that top general models often butcher drug names, with error rates 2-5x higher than other categories.
–Microsoft's VibeVoice-ASR 9B (#3) beating its closed MAI-Transcribe-1 (#11) by 1.7 points highlights the power of LLM-backed transcription.
–Qwen3-ASR 1.7B emerges as the best small open-source model, delivering near-Gemini performance at 14x the speed of larger models.
–Deepgram Nova-3 Medical holds its ground as the fastest cloud API, completing files in just 13 seconds without compromising accuracy.

// TAGS

medical-stt-benchmarkbenchmarksttspeechmedicalllmopen-sourcegeminiqwen3vibevoice

DISCOVERED

50d ago

2026-04-08

PUBLISHED

50d ago

2026-04-08

RELEVANCE

8/ 10

AUTHOR

MajesticAd2862

// KEEP READING

More AI developer news from the feed

EXPLORE FULL FEED

NEWS23m ago

CodeRabbit Draws Demo Crowds at App.js Conf

A retweeted post from CodeRabbit says the team is having a hectic time at App.js Conf and is asking for more hands because they cannot keep up with showing people the product. This reads as a traction and field-interest signal rather than a product announcement, with the main takeaway being that the booth/demo activity is pulling in more attention than the team can comfortably handle.

NEWS27m ago

Anthropic hits first profit on $10.9B Q2 revenue

Anthropic is poised to record its first operating profit in Q2 2026, driven by a massive $10.9 billion revenue run and a strategic pivot to enterprise sales. The financial turnaround highlights the explosive monetization potential of developer-focused coding agents like Claude Code.

NEWS27m ago

Anthropic hits profitability as Claude Code usage surges

Anthropic achieved its first operating profit in Q2 2026, driven by a massive shift toward usage-based enterprise pricing. The company's agentic CLI, Claude Code, has become its primary revenue engine by consuming high volumes of tokens for autonomous coding tasks.