Apps
Models
Pricing
Login
Compatible Apps
(1)
Audio WebUI
Generative Audio AI
Text-To-Speech • Speech-To-Text • Text-To-Music • Audio Translation
Launch a Laboratory
openai /
Whisper
The Whisper family of AI models, developed by OpenAI, represents a significant advancement in automatic speech recognition (ASR) technology. Trained on an extensive dataset of 680,000 hours of multilingual and multitask supervised data, Whisper models excel in transcribing diverse languages and dialects, even in challenging acoustic environments, without the need for fine-tuning.
Release: 2023-11-08
1 Models
Meta /
AudioCraft
AudioCraft is an open-source AI framework developed by Meta that enables high-quality music and audio generation from text prompts, utilizing models like MusicGen for text-to-music conversion, AudioGen for text-to-sound effects, and EnCodec for efficient audio processing.
Release: 2023-11-06
2 Models
suno /
Bark
The Bark family of AI models, developed by Suno, is a transformer-based text-to-audio system capable of generating highly realistic, multilingual speech, as well as music, background noises, and nonverbal sounds like laughter and sighs, all from textual prompts.
Release: 2023-04-28
1 Models
Show All 5