YOU ARE VIEWING ONE ITEM FROM THE AICRIER FEED

erm is a command-line tool that transcribes English speech with Whisper and automatically removes filler words using FFmpeg crossfades.

AICrier tracks AI developer news across Product Hunt, GitHub, Hacker News, YouTube, X, arXiv, and more. This page keeps the article you opened front and center while giving you a path into the live feed.

// WHAT AICRIER DOES

7+

TRACKED FEEDS

24/7

SCRAPED FEED

Short summaries, external links, screenshots, relevance scoring, tags, and featured picks for AI builders.

erm is a command-line tool that transcribes English speech with Whisper and automatically removes filler words using FFmpeg crossfades.
OPEN LINK ↗
// 6d agoOPENSOURCE RELEASE

erm is a command-line tool that transcribes English speech with Whisper and automatically removes filler words using FFmpeg crossfades.

erm is an open-source command-line tool that transcribes audio using Whisper and splits out disfluencies like "um" and "uh" with clean FFmpeg crossfading. It uses faster-whisper for speech-to-text and performs multiple detection passes, including gap analysis, duration-based spotting, and embedded filler detection to locate filler words. To ensure the edits sound natural and seamless, the tool aligns cuts to zero-crossings, applies adaptive crossfades, and matches room tone to prevent audible clicks or abrupt shifts in background noise.

// ANALYSIS

While cloud-based editors like Descript have popularized automated filler word removal, erm offers a free, local-first CLI alternative for users who want to script their audio workflows or keep their files private.

* Local-first execution: Runs transcription and audio editing on the user's machine without external APIs.

* High-quality audio editing: Employs zero-crossing cuts, room tone matching, and adaptive crossfades to ensure smooth transitions between edits.

* Whisper-powered accuracy: Utilizes faster-whisper to detect filler words, combined with gap and duration analysis.

* Easy developer integration: Can be easily run as a CLI tool or integrated into automated media processing pipelines.

// TAGS
audiowhispercliffmpegspeech-processingopen-sourcepython

DISCOVERED

6d ago

2026-06-13

PUBLISHED

6d ago

2026-06-13

RELEVANCE

7/ 10

AUTHOR

Github Awesome