OmniDirector

NEW

Free Video Open-Source

LikeWebsite Promote

Key Features

Clones diverse camera motions from reference videos to animate source images.

Supports one-shot and multi-shot camera-motion transfer.

Uses a camera grid representation derived from rendered camera poses.

Injects camera grids into MMDiT alongside other control signals.

Uses a prompt expansion agent to integrate multimodal control signals.

Handles dynamic motion, scene generalization, and special camera movement.

Demonstrates coherent transitions and shot relationships across multi-shot videos.

Provides paper, public code link, and many direct demo videos.

The method represents camera motion through a camera grid rendered from reference-video camera poses in an empty 3D space. During training, this camera grid is injected into an MMDiT with other controls, while a hierarchical prompt expansion agent integrates multimodal signals at inference.

OmniDirector is useful for video generation workflows that need to copy cinematic camera language, not just object motion. It can reproduce aerial fly-throughs, descents, dolly zooms, bullet-time effects, and lens-distortion-like camera behavior while preserving generated content.

Get more likes & reach the top of search results by adding this button on your site!

OmniDirector

Key Features

Zero to AI Engineer

Subscribe to the AI Search Newsletter