Elevate3D operates in a view-by-view manner, alternating between texture and geometry refinement. Unlike previous methods that have largely overlooked geometry refinement, Elevate3D leverages geometric cues from images refined with HFS-SDEdit by employing state-of-the-art monocular geometry predictors. This approach ensures detailed and accurate geometry that aligns seamlessly with the enhanced texture. Elevate3D outperforms recent competitors by achieving state-of-the-art quality in 3D model refinement.
Elevate3D's key contribution is its ability to refine 3D models by alternating between texture and geometry refinement, creating high-quality assets with well-aligned features. HFS-SDEdit enables the diffusion model to freely generate high-quality low-frequency features, shifting the image generation away from the input's low-quality domain. Simultaneously, HFS-SDEdit injects high-frequency components from the input to preserve its identity and crucial details without inheriting low-quality artifacts.