CLIPGaussian: Universal and Multimodal Style Transfer Based on Gaussian Splatting
Kornel Howil, Joanna Waczyńska, Piotr Borycki, Tadeusz Dziarmaga, Marcin Mazur, Przemysław Spurek
2025-05-30
Summary
This paper talks about CLIPGaussian, a new AI tool that can change the style of not just pictures and videos, but also 3D objects and even entire scenes, using either text descriptions or example images as a guide.
What's the problem?
The problem is that most style transfer methods only work for simple images and can't handle more complex things like 3D models or scenes that change over time, and they often can't easily use both text and images to guide the style.
What's the solution?
The researchers created CLIPGaussian, which uses a special way of representing visuals called Gaussian splatting. This lets the system directly adjust the colors and shapes of objects in 2D, 3D, and even 4D (where things change over time) based on what you describe in text or show in a reference image.
Why it matters?
This is important because it opens up new creative possibilities for artists, filmmakers, and game designers, making it much easier to apply unique styles to all kinds of media using simple instructions or examples.
Abstract
CLIPGaussians is a style transfer framework that supports text- and image-guided stylization of 2D images, videos, 3D objects, and 4D scenes by optimizing color and geometry directly on Gaussian primitives.