vLLM adds Cohere-Transcribe for efficient ASR

// 108d agoOPENSOURCE RELEASE

vLLM adds Cohere-Transcribe for efficient ASR

vLLM has integrated Cohere's new cohere-transcribe-03-2026 model, providing native support for high-throughput speech-to-text. By leveraging variable-length encoder inputs, the integration eliminates traditional padding overhead to maximize inference efficiency.

// ANALYSIS

Cohere's move into ASR via vLLM directly challenges Whisper's dominance because the integration is built around variable-length encoder inputs instead of fixed-padding models. Adding it to the v1/audio/transcriptions API gives developers a unified stack for serving both LLMs and state-of-the-art ASR from a single engine, and native CohereAsrForConditionalGeneration support makes it a credible open-weights alternative to proprietary transcription APIs. The standardized English text normalizers in vLLM's test suite help make the integration feel production-ready for enterprise deployments.

// TAGS

vllmcoherespeechopen-sourceinferenceapiaudio-gen

DISCOVERED

108d ago

2026-03-26

PUBLISHED

108d ago

2026-03-26

RELEVANCE

8/ 10

AUTHOR

LinkSea8324

// KEEP READING

More AI developer news from the feed

EXPLORE FULL FEED

INFRA24m ago

NaN Builders hosts parallel OpenCode agents

NaN Builders is a flat-rate GPU inference platform offering developers persistent, isolated microVM environments. A developer demonstrated the platform by running three parallel OpenCode coding agents using self-hosted models hosted directly on NaN Builders, avoiding token-metered fees.

INFRA49m ago

Prime Intellect launches verifiers v1 for agentic RL

Prime Intellect has released verifiers v1, an overhauled environment stack for agentic RL that decomposes environments into composable tasksets, harnesses, and runtimes. The update introduces a managed interception server that records traces as message DAGs, enabling O(n) scaling to make long-horizon training and router replay feasible.

OPEN SOURCE3h ago

git/star-history-chart embeds star charts in READMEs

git/star-history-chart is a skill for the Claude Code Templates CLI that generates a repository's star history chart as an SVG and embeds it in the README. The system uses the repository's native GITHUB_TOKEN to fetch stargazer data via a GitHub Actions workflow and commits the output directly, eliminating the need for third-party services or external secret configurations.