Industrial-grade speech recognition toolkit: 170x realtime, 50+ languages, speaker diarization, emotion detection, streaming, and OpenAI-compatible API.
A lightweight yet powerful audio-to-MIDI converter with pitch bend detection
YouTube, Apple Podcasts (and more) to readable Markdown.
Self-hosted AI audio transcription
Local Audio Transcription Tool