GitHub - openai whisper: Robust Speech Recognition via Large-Scale Weak . . . Whisper is a general-purpose speech recognition model It is trained on a large dataset of diverse audio and is also a multitasking model that can perform multilingual speech recognition, speech translation, and language identification
Introducing Whisper - OpenAI Whisper is an automatic speech recognition (ASR) system trained on 680,000 hours of multilingual and multitask supervised data collected from the web We show that the use of such a large and diverse dataset leads to improved robustness to accents, background noise and technical language