🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.
OpenVINOâ„¢ is an open source toolkit for optimizing and deploying AI inference
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
Port of OpenAI’s Whisper model in C/C++
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
OpenAI Whisper ASR Webservice API
Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node
Faster Whisper transcription with CTranslate2
Whisper.net. Speech to text made simple using Whisper Models
A speech to text IBus engine using VOSK