Key Features

Simulates embodied egocentric worlds driven by human-motion action control.
Uses pose-associated anchor views to ground local scene customization.
Combines RGB images, 3D poses, and evolution prompts for each anchor view.
Supports localized text-driven scene evolution from a first-person perspective.
Includes hybrid-view human action control for embodied exploration.
Uses progressive training according to the project page.
Provides a paper link and Hugging Face dataset link.
Shows comparisons and demos for customized egocentric world simulation.

Each anchor view provides an RGB image, a 3D pose for spatial grounding, and an evolution prompt that localizes how the scene should change. The method combines hybrid-view human action control, evolvable anchor-view customization, and progressive training to generate personalized egocentric environments.


AnchorWorld is useful for embodied AI, game-like simulation, egocentric video generation, and interactive world-model research. The page links to a paper and dataset, while code availability is not clearly shown in the submitted page.

Get more likes & reach the top of search results by adding this button on your site!

Embed button preview - Light theme
Embed button preview - Dark theme
TurboType Banner
Zero to AI Engineer Program

Zero to AI Engineer

Skip the degree. Learn real-world AI skills used by AI researchers and engineers. Get certified in 8 weeks or less. No experience required.

Subscribe to the AI Search Newsletter

Get top updates in AI to your inbox every weekend. It's free!