Industrial-grade speech recognition toolkit: 170x realtime, 50+ languages, speaker diarization, emotion detection, streaming, and OpenAI-compatible API.
Generate audiobooks from e-books, voice cloning & 1158+ languages!
A generative speech model for daily dialogue.