Loading...
Browse Models
Whisper is an open-source automatic speech recognition model family developed by OpenAI that employs a Transformer encoder-decoder architecture for multilingual transcription, translation, and language identification. Trained on over 680,000 hours of audio data spanning 98 languages, the models support both short-form and long-form audio processing through chunking techniques and are available in multiple size variants under the MIT License.