WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)