GPT-Realtime-Whisper brings streaming speech to text

// 1h agoMODEL RELEASE

GPT-Realtime-Whisper brings streaming speech to text

OpenAI’s GPT-Realtime-Whisper is a low-latency transcription model that turns audio into text as people speak. It’s aimed at live captions, meeting notes, and other workflows where the transcript needs to keep pace with the speaker.

// ANALYSIS

This is the unglamorous part of voice AI that actually matters: if transcription lags, the whole experience feels broken. GPT-Realtime-Whisper makes the Realtime stack more useful for production workflows by shrinking the delay between speech and text.

–Live STT is the substrate for captions, note-taking, support triage, and voice agents that need continuous understanding
–Streaming transcripts unlock partial results earlier, which matters more than perfect end-state text in real-time products
–OpenAI is pricing it at $0.017/min, which signals this is meant for high-volume operational use, not demos
–The release reinforces the idea that voice stacks are becoming modular: reasoning, translation, and transcription are now separate building blocks
–For developers, the main win is UX: less waiting, fewer “please hold” moments, and more natural conversation flow

// TAGS

speechsttstreamingapivoice-agentgpt-realtime-whisper

DISCOVERED

1h ago

2026-05-07

PUBLISHED

1h ago

2026-05-07

RELEVANCE

9/ 10

AUTHOR

OpenAI

// KEEP READING

More AI developer news from the feed

EXPLORE FULL FEED

UPDATE30m ago

OpenCode warps sessions across workspaces

OpenCode, the open-source AI coding agent for the terminal, shipped a warping improvement that makes it easier to move an active session into another workspace while preserving uncommitted file changes. The feature is meant to remove the friction around deciding whether to start in a worktree first, which matters for people juggling parallel coding tasks and switching contexts frequently.

UPDATE1h ago

Claude Code weekly caps bite harder

Claude Code users are reporting that session usage stays low while weekly usage climbs fast, making the weekly cap the real bottleneck. The post reflects Anthropic’s tighter rate-limit regime for heavy agentic coding workloads, not a fresh launch.

OPEN SOURCE1h ago

Pi v0.74.0 moves repo, packages to earendil-works

Pi v0.74.0 is a maintenance release focused on the project’s repo and package migration. It updates links and package references for the move to earendil-works/pi-mono and the @earendil-works/* package scope.