AI voice models

Every generation on this site uses Seed Audio 1.0 — an audio model with text-to-speech and voice cloning, 20 preset voices, multilingual output, and speed, pitch and volume controls.

Models

Generate voice from text or reference audio for your use case.

Model specs

Key Seed Audio 1.0 specs.

ModelInputOutputFormatsVoicesCloningCreditsBest for
Seed Audio 1.0Text, reference audio (up to 3)Audio fileMP3 / WAV / PCM / OGG (up to 48 kHz)20 presetsYes (30s each)Usage-based (shown upfront)Narration, TTS, voice cloning

Pick by use case

When to use Seed Audio 1.0.

Model FAQ

Common questions about Seed Audio 1.0.