Pixel3DMM proposes a FLAME fitting optimization that solves for the 3DMM parameters from the uv-coordinate and normal estimates. This approach enables the reconstruction of posed and neutral facial geometry. The model is evaluated on a new benchmark for single-image face reconstruction, which features high diversity facial expressions, viewing angles, and ethnicities. The results show that Pixel3DMM outperforms the most competitive baselines by over 15% in terms of geometric accuracy for posed facial expressions.


Pixel3DMM has a wide range of applications in computer vision, graphics, and face analysis. Its ability to reconstruct 3D faces from single images makes it a valuable tool for face recognition, facial expression analysis, and face synthesis. The model's accuracy and robustness also make it suitable for use in various industries, such as entertainment, healthcare, and security. Additionally, Pixel3DMM's surface normal estimation and uv-coordinate prediction capabilities can be used for other tasks, such as 3D reconstruction and tracking.

Key Features

Fine-tuned DINO ViT for per-pixel surface normal and uv-coordinate prediction
Exploits latent features of the DINO foundation model
Trained on three high-quality 3D face datasets against the FLAME mesh topology
FLAME fitting optimization for 3DMM parameter estimation
Reconstructs posed and neutral facial geometry
Evaluates on a new benchmark for single-image face reconstruction
Outperforms competitive baselines in geometric accuracy
Suitable for various applications in computer vision, graphics, and face analysis

Get more likes & reach the top of search results by adding this button on your site!

Embed button preview - Light theme
Embed button preview - Dark theme

Subscribe to the AI Search Newsletter

Get top updates in AI to your inbox every weekend. It's free!