Omnihuman

omni-human

Omnihuman Image + Audio to VideoLLMs.txt

Generates hyper-realistic full-body videos from a single image and audio/video input using advanced diffusion transformers, supporting diverse styles, natural motion, and HD outputs up to 1024×1024 at 30fps.

API Playground API Documentation

Input

Per sec generation will cost 0.168$

Output

Idle

Unknown content type

Related Models

Discover similar models you might be interested in

Google

Veo 3 Fast preview

KlingAI

Kling V2.5 Turbo Image To Video

ltx

LTX 2 PRO Text To Video

Minmax

Hailuo 02 Image To Video

KlingAI

Kling V1.6 Multi Image To Video

Popular

Google

Veo 3 Fast

Popular

Bytedance

Seedance 1.0 Pro Image to Video

Popular

Bytedance

Seedance 1.0 Pro Fast Image To Video

Alibaba

Wan 2.5 Image to Video

Open Ai

Sora-2

Minmax

Hailuo 2.3 Image To Video

Alibaba

Wan2.6 Text To Video