Create & Edit Images Instantly with Grok Imagine

Try Grok Imagine
Skip to main content
Omnihuman thumbnail

Omnihuman

by Bytedance
omni-human
Omnihuman Image + Audio to VideoLLMs.txt
Generates hyper-realistic full-body videos from a single image and audio/video input using advanced diffusion transformers, supporting diverse styles, natural motion, and HD outputs up to 1024×1024 at 30fps.
API PlaygroundAPI Documentation

Input

File preview

File preview

or record audio
Audio recording is not supported in your browser

Per sec generation will cost 0.168$

Output

Idle

Unknown content type

Related Models

Discover similar models you might be interested in

Veo 3 Fast preview
Google

Veo 3 Fast preview

Kling V2.5 Turbo Image To Video
KlingAI

Kling V2.5 Turbo Image To Video

LTX 2 PRO Text To Video
ltx

LTX 2 PRO Text To Video

Hailuo 02 Image To Video
Minmax

Hailuo 02 Image To Video

Kling V1.6 Multi Image To Video
KlingAI

Kling V1.6 Multi Image To Video

Veo 3 Fast
Popular
Google

Veo 3 Fast

Seedance 1.0 Pro Image to Video
Popular
Bytedance

Seedance 1.0 Pro Image to Video

Seedance 1.0 Pro Fast Image To Video
Popular
Bytedance

Seedance 1.0 Pro Fast Image To Video

Wan 2.5 Image to Video
Alibaba

Wan 2.5 Image to Video

Sora-2
Open Ai

Sora-2

Hailuo 2.3 Image To Video
Minmax

Hailuo 2.3 Image To Video

Wan2.6 Text To Video
Alibaba

Wan2.6 Text To Video