The open-source AI voice studio. Clone, dictate, create.
Natural (2-way) voice conversations with Claude Code
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
Faster Whisper transcription with CTranslate2
A nearly-live implementation of OpenAI’s Whisper.
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
Voice-to-text with push-to-talk for Wayland compositors