WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.
Faster Whisper transcription with CTranslate2
OpenVINOâ„¢ is an open source toolkit for optimizing and deploying AI inference
Whisper.net. Speech to text made simple using Whisper Models
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
Voice-to-text with push-to-talk for Wayland compositors
Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node
OpenAI Whisper ASR Webservice API
A speech to text IBus engine using VOSK