Question 1

Does it support Japanese?

Accepted Answer

Yes. It supports English, Chinese, Japanese, Spanish, Indonesian and Portuguese, generating natural speech from Japanese text.

Question 2

Can I clone my own voice?

Accepted Answer

Yes — upload reference audio (up to 3 clips, 30 seconds each) and Seed Audio reproduces that voice. Cleaner audio gives better fidelity.

Question 3

Can I adjust tone or emotion?

Accepted Answer

Describe the mood in the prompt, such as “calm tone” or “cheerful.” Combine that with the speed and pitch controls for finer results.

Question 4

What formats can I export?

Accepted Answer

MP3, WAV, PCM or OGG Opus, with sample rates up to 48 kHz, downloadable right after generation.

Question 5

How far can I change speed and pitch?

Accepted Answer

Speed 0.5–2.0x, pitch ±12 semitones, and volume 0.5–2.0x.

Question 6

Is it free to try?

Accepted Answer

New accounts get free credits to try your first generations. See the pricing page for details.

Seed Audio 1.0 features

Feature FAQ