Every generation on this site uses Seed Audio 1.0 — an audio model with text-to-speech and voice cloning, 20 preset voices, multilingual output, and speed, pitch and volume controls.
Generate voice from text or reference audio for your use case.
Key Seed Audio 1.0 specs.
When to use Seed Audio 1.0.
Seed Audio 1.0
Natural, expressive text-to-speech for video and explainer narration.
Seed Audio 1.0
Reproduce a voice from reference audio (up to 3 clips, 30s each).
Seed Audio 1.0
Reads long scripts with steady tone that stays consistent across chapters.
Seed Audio 1.0
Choose voices and speed suited to a clear, steady read.
Seed Audio 1.0
Supports English, Chinese, Japanese, Spanish, Indonesian and Portuguese, including mixed-language text.
Seed Audio 1.0
Pick from 20 presets to match the tone and use case.
Common questions about Seed Audio 1.0.