SoftVC VITS Singing Voice Conversion
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
Build local voice agents with open-source models
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
OpenAI Whisper ASR Webservice API