Whisper | Open Laboratory

Apps Models Pricing

Loading...

/

Terms & Conditions

/

/

Acceptable Use Policy

Browse Models

Whisper

Model Family Report

Compatible Apps

Audio WebUI

gitmylo /

Audio WebUI

Experiment with various cutting-edge audio generation models, such as Bark (Text-to-Speech), RVC (Voice Cloning), and MusicGen (Text-to-Music).

Explore the Future of AI

Your server, your data, under your control

Whisper is an open-source automatic speech recognition model family developed by OpenAI that employs a Transformer encoder-decoder architecture for multilingual transcription, translation, and language identification. Trained on over 680,000 hours of audio data spanning 98 languages, the models support both short-form and long-form audio processing through chunking techniques and are available in multiple size variants under the MIT License.