AdInject: Real-World Black-Box Attacks on Web Agents via Advertising Delivery
Haowei Wang, Junjie Wang, Xiaojun Jia, Rupeng Zhang, Mingyang Li, Zhe Liu, Yang Liu, Qing Wang
2025-05-28
Summary
This paper talks about AdInject, a new way to attack web agents powered by vision-language models by using online ads to sneak in harmful content and trick the AI into doing things it shouldn't.
What's the problem?
The problem is that web agents, which use AI to interact with websites and help automate tasks, are much more vulnerable to these kinds of attacks than regular chatbots. This is because they pay attention to both what they see and what they read, and attackers can use ads or pop-ups to deliver hidden instructions or misleading information that the AI might follow.
What's the solution?
To show how serious this issue is, the researchers created AdInject, which uses real internet ads to secretly inject malicious content into the web pages that these agents visit. They tested this method and found that it can easily fool the agents, proving that current security measures are not strong enough.
Why it matters?
This matters because as more companies use AI web agents for important jobs, these vulnerabilities could be abused by hackers to cause real harm or steal information. The research shows that we need better ways to protect AI systems that work online, especially those that interact with ads and other dynamic web content.
Abstract
AdInject is a novel real-world black-box attack method leveraging internet advertising to inject malicious content into vision-language model-based web agents, demonstrating significant vulnerability in web agent security.