NoobScribe ships local transcription, diarization, speaker memory
NoobScribe is a self-hosted, Whisper-compatible FastAPI transcription stack built on parakeet-diarized and packaged for Docker, with a web UI for recording or uploading audio and managing recordings, transcripts, and speakers. It uses Canary 1B v2 for speech-to-text, Pyannote for diarization, and Chroma-backed speaker memory so known voices keep their names across future and retroactive transcripts.
The real product is not speech-to-text; it is transcript memory. By persisting speaker embeddings, NoobScribe makes diarization compound across meetings instead of resetting every time.
- –Whisper compatibility lowers adoption friction because existing tools can point at `/v1/audio/transcriptions` without rewriting clients.
- –Chroma-backed embeddings are the moat: new speakers can be matched later and old transcripts can be rewritten with better names.
- –The setup tradeoff is real, with a Hugging Face token, Docker, and possible GPU dependence, but that buys local control and privacy.
- –This shines most in meetings, interviews, and recurring calls where speaker attribution matters almost as much as raw transcript accuracy.
DISCOVERED
64d ago
2026-03-24
PUBLISHED
64d ago
2026-03-24
RELEVANCE
AUTHOR
meganoob1337