DeContext as Defense: Safe Image Editing in Diffusion Transformers

Linghui Shen, Mingyue Cui, Xingyi Yang

2025-12-19

DeContext as Defense: Safe Image Editing in Diffusion Transformers

Summary

This paper focuses on protecting personal images from being secretly changed using powerful AI image editing tools, specifically those called in-context diffusion models.

What's the problem?

These new AI tools are really good at editing images based on simple instructions, but this also means someone could take a picture of you and alter it to make it look like you did or said something you didn't, or even steal your identity. Previous methods to protect images didn't work well with these newer, more advanced AI systems.

What's the solution?

The researchers developed a technique called DeContext. It works by subtly changing the original image in a way that disrupts how the AI 'pays attention' to the original when making edits. Think of it like slightly blurring the connection between the original image and the edited version, making it harder for the AI to accurately manipulate it based on the original. They found that focusing these changes on specific parts of the AI's processing – early stages and certain sections called transformer blocks – made the method even more effective.

Why it matters?

This research is important because it provides a way to defend against the misuse of AI image editing technology. It helps protect people's privacy and prevents the creation of fake or misleading images that could harm individuals or spread misinformation. It shows a practical way to make these powerful AI tools safer to use.

Abstract

In-context diffusion models allow users to modify images with remarkable ease and realism. However, the same power raises serious privacy concerns: personal images can be easily manipulated for identity impersonation, misinformation, or other malicious uses, all without the owner's consent. While prior work has explored input perturbations to protect against misuse in personalized text-to-image generation, the robustness of modern, large-scale in-context DiT-based models remains largely unexamined. In this paper, we propose DeContext, a new method to safeguard input images from unauthorized in-context editing. Our key insight is that contextual information from the source image propagates to the output primarily through multimodal attention layers. By injecting small, targeted perturbations that weaken these cross-attention pathways, DeContext breaks this flow, effectively decouples the link between input and output. This simple defense is both efficient and robust. We further show that early denoising steps and specific transformer blocks dominate context propagation, which allows us to concentrate perturbations where they matter most. Experiments on Flux Kontext and Step1X-Edit show that DeContext consistently blocks unwanted image edits while preserving visual quality. These results highlight the effectiveness of attention-based perturbations as a powerful defense against image manipulation.

View Paper