ViDAR: Video Diffusion-Aware 4D Reconstruction From Monocular Inputs

Michal Nazarczuk, Sibi Catley-Chandar, Thomas Tanay, Zhensong Zhang, Gregory Slabaugh, Eduardo Pérez-Pellitero

2025-06-24

ViDAR: Video Diffusion-Aware 4D Reconstruction From Monocular Inputs

Summary

This paper talks about ViDAR, a new method that creates detailed 4D reconstructions of changing scenes from a single video camera by using diffusion-aware techniques.

What's the problem?

The problem is that it is hard to generate accurate and high-quality 3D models that also capture movement and changes over time from just one camera view, often resulting in blurry or inconsistent visuals.

What's the solution?

The researchers used a diffusion-aware approach that improves both the visual quality and the geometric accuracy of the reconstructed scenes, allowing the system to better understand and recreate dynamic environments from monocular video inputs.

Why it matters?

This matters because it helps create realistic 3D models and videos for virtual reality, gaming, and robotics using simpler setups like a single camera, making advanced 3D modeling more accessible and practical.

Abstract

ViDAR uses diffusion-aware reconstruction to generate high-quality novel views of dynamic scenes from monocular video, outperforming existing methods in visual quality and geometric consistency.

View Paper