Vulnerability  ·  2026-06-13

StakeBench: No Production Web Agent Consistently Blocks Prompt Injection — Direct Attacks Succeed 79%+ Across GPT-5 and Gemini in 3,168 Simulations

VulnerabilityHigh impactGlobal
Researchers from Nanyang Technological University, ST Engineering, IBM Research, and the University of Illinois Urbana-Champaign published StakeBench (arXiv 2606.13385) on June 12, 2026 — the first stakeholder-centric prompt injection benchmark. Testing NanoBrowser and BrowserUse agents with GPT-5 and Gemini 2.5-Flash across 3,168 attack simulations, they found zero attack objectives were reliably resisted. Direct injection succeeded in more than 79% of configurations; indirect attacks succeeded 41.67–68.16%. The study documents a 'stealthy parasitism' failure mode where agents complete users' tasks while simultaneously executing attacker objectives — invisible to conventional detection.
Adversarial instructions embedded in web page content (product reviews, comments, page text) that AI web agents encounter during task execution. Stealthy parasitism attacks are particularly dangerous because the user sees correct output while the attacker's objective (e.g., biased product recommendations, credential harvesting) is simultaneously achieved.
All production web agents tested — those powered by GPT-5 and Gemini 2.5-Flash using NanoBrowser and BrowserUse frameworks. The paper's findings apply broadly to any LLM agent that browses the web to complete tasks on behalf of users.
No complete mitigation exists. Recommended controls: implement stakeholder-aware harm monitoring that tracks multi-party impact (not just task success); treat prompt injection as a distribution of harm requiring contextual assessment; apply agent output monitoring for asymmetric behaviour patterns indicative of stealthy parasitism.
Sources
arXiv 2606.13385 — Who Pays the Price? Stakeholder-Centric Prompt Injection Benchmarking for Real-World Web AgentsCSO Online — Prompt injection breaks today's AI agents, study warns
See this in the live feed Explore related AI security and governance findings — updated every morning.
Open the feed →