FlowAct-R1

NEW

Paid Video Humanoid Generation

LikeWebsite Promote

Key Features

Streaming and infinite-length generation

Real-time performance with low latency

Vividness and generalization

Highly responsive interaction capabilities

Robust to various character and motion styles

Outperforms state-of-the-art methods

Enables infinite durations for seamless interaction

Suitable for livestreaming and video conferencing

The model delivers exceptional behavioral vividness and perceptual realism, capturing subtle human nuances for natural transitions across complex interactive states. It maintains high-fidelity synthesis across diverse character styles from a single reference image. FlowAct-R1 consists of training and inference stages, including converting base full-attention DiT to streaming AR model via autoregressive adaptation and joint audio-motion finetuning for better lip-sync and body motion.

FlowAct-R1 exhibits highly responsive interaction capabilities, demonstrating significant potential to empower real-time, low-latency instant communication scenarios. It is robust to various character and motion styles, and outperforms state-of-the-art methods in human preference evaluation. The framework enables infinite durations for truly seamless interaction, making it suitable for applications such as livestreaming and video conferencing. It achieves real-time streaming, infinite-duration generation, and superior behavioral naturalness.

Get more likes & reach the top of search results by adding this button on your site!

FlowAct-R1

Key Features

Zero to AI Engineer

Subscribe to the AI Search Newsletter