Key Features

Reconstructs 3D scenes from arbitrary sparse input views.
Supports unordered captures instead of requiring a rigid input sequence.
Uses video diffusion modeling for reconstruction and novel-view synthesis.
Maintains a persistent global scene memory for long-range conditioning.
Supports long trajectories with large numbers of generated frames.
Preserves explicit geometric control during reconstruction.
Useful for casual capture, 3D scanning, and scene digitization.
Provides public code and checkpoints for research evaluation.

The system combines explicit geometric control with video diffusion modeling to improve novel-view synthesis and reconstruction consistency. It uses a persistent global scene memory built from capture views, allowing the model to condition on more than one or two frames and maintain frame-level correspondence over major viewpoint changes. This helps address the scalability limits of earlier diffusion-based reconstruction methods.


AnyRecon is valuable because it makes sparse-view reconstruction more flexible and practical. It can support long trajectories with hundreds of frames, handle unordered inputs, and produce large-scale 3D reconstructions that are better suited to real capture conditions.

Get more likes & reach the top of search results by adding this button on your site!

Embed button preview - Light theme
Embed button preview - Dark theme
TurboType Banner
Zero to AI Engineer Program

Zero to AI Engineer

Skip the degree. Learn real-world AI skills used by AI researchers and engineers. Get certified in 8 weeks or less. No experience required.

Subscribe to the AI Search Newsletter

Get top updates in AI to your inbox every weekend. It's free!