Create & Edit Images Instantly with Grok Imagine

Try Grok Imagine
Skip to main content
Inworld Text to Speech thumbnail

Inworld Text To Speech

by Inworld
inworld-tts-1
Inworld/Text To SpeechLLMs.txt
Ultra-realistic, low-latency voice cloning supports 11 languages, instant & professional cloning, 48 kHz audio, fine emotional control, API access—ideal for dynamic, expressive AI interactions.
API PlaygroundAPI Documentation

Input

Per million characters will cost 6$

Output

Idle

Unknown content type

Related Models

Discover similar models you might be interested in

song extender
Sonauto

song extender

song inpaint
Sonauto

song inpaint

Elevenlabs Voice Changer
Eleven Labs

Elevenlabs Voice Changer

Music
Popular
Eleven Labs

Music

eleven_multilingual_v2
Popular
Eleven Labs

eleven_multilingual_v2

Baldi

Baldi

Kanye West

Song generation
Sonauto

Song generation

eleven_sound_effect
Eleven Labs

eleven_sound_effect

Qwen Text to Speech