OpenVINO™ is an open source toolkit for optimizing and deploying AI inference
Faster Whisper transcription with CTranslate2
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.
Voice-to-text with push-to-talk for Wayland compositors
Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node
OpenAI Whisper ASR Webservice API
Whisper.net. Speech to text made simple using Whisper Models
A speech to text IBus engine using VOSK