Prompt injection

Definition

An attack where a malicious instruction is hidden inside text that an AI reads — such as a document, email, or web page — tricking the AI into ignoring its original instructions and doing what the attacker wants instead. Think of it as the AI equivalent of forging a memo from the CEO and slipping it into an employee's inbox. The AI cannot reliably tell the difference between legitimate instructions from its operators and forged ones from an attacker.

Why it matters

Any AI that reads or summarises external content — customer emails, web pages, uploaded documents — is a potential target. A successful attack can cause the AI to leak confidential data, take unauthorised actions, or spread misinformation, all without the user or operator realising it.

Findings on this topic (60)

Strands Agents Tools — Credential Exfiltration via LLM-Steerable http_request Proxy Routing Microsoft Copilot for Word — Cross-Domain Prompt Injection Enables Self-Propagating AI Worm Across Documents Microsoft Azure DevOps MCP Server - Hidden PR Comments Hijack AI Review Agents (Confused Deputy)Qualys ships TotalAI GA — continuous AI governance across shadow GenAI, MCP servers, and agentic workloads Microsoft Defender adds pre-delivery prompt injection protection for email inbox (public preview)Microsoft Defender adds runtime threat detection (preview) and real-time protection (GA) for Microsoft Agent 365 agents Google Agent Development Kit (ADK) — Tool Confirmation Continuation Forgery Enables Unauthorized Tool Execution NoteGen AI Chat App — Stored XSS via Unsanitized Markdown Rendering of Model Responses (Tauri privileged webview)Kimi Code (Moonshot AI) — SSRF via Incomplete FetchURL Denylist Bypassed by DNS Rebinding / Redirect Google MCP Toolbox for Databases — SSRF and Credential Exfiltration via Unvalidated FHIR Page-Fetch Tool nolabs launches 'nono' — open-source kernel-enforced sandboxing runtime for AI agents and MCP workloads BlenderMCP Path Traversal via Poly Haven Asset Download Enables Arbitrary File Write (CVE-2026-66004)Suna Agentic Platform — Broken Access Control in Message Queue API Exposes Cross-User Prompt Data (CVE-2026-66027)Void AI Code Editor — Path Traversal in Agent File-Reading Tools Enables Silent Credential Exfiltration BlenderMCP — Path Traversal in Polyhaven Asset Download via MITM/Prompt Injection Suna Agentic AI Platform — Broken Access Control Allows Prompt Injection Into Other Users' Running Agent Sessions Box unveils AI agent security and governance controls (guardrails, prompt-injection detection, MCP guardrails)Akamai Demonstrates End-to-End Precision Prompt Injection Kill Chain Against Deployed Travel-Booking AI Agent ISO/IEC 27090 reaches FDIS (Final Draft International Standard) stage — AI security threats guidance Lineation.ai publicly launches Zero Trust runtime security control plane for AI agents Agentic-Flow MCP Server OS Command Injection (CVE-2026-58195)Lineation.ai — Public Launch of Zero Trust Runtime Security Control Plane for Autonomous AI Agents OpenAI GPT-Red Automated Red-Teamer Demonstrates Live Prompt-Injection Compromise of Deployed Agentic Vending-Machine System LiteLLM MCP Server Creation — Command Injection via Unvalidated JSON Config (STDIO Transport)Red Hat details layered agent sandboxing combining OpenShift Sandboxed Containers (Kata) with NVIDIA OpenShell Claude Desktop "PromptFiction" — Zero-Click Prompt Injection via claude:// URI Scheme OpenAI GPT-Red — Automated Adversarial Self-Play Red-Teamer for Prompt Injection AWS Security Hub Adds AI Workload Protection (GuardDuty AI Protection GA + AI Inventory) and Azure Multicloud Support PraisonAI AgentMail Webhook Signature Verification Bypass Enables Message Spoofing PraisonAI AICoder Arbitrary File Write and Command Execution via LLM Tool Calls PraisonAI CodeAgent — Unrestricted LLM-Generated Python Execution Enables RCE Claude Code Adds Sandboxed Built-In Browser with Safety Classifiers for Agentic Web Actions PraisonAI — Prompt Injection Defense Bypass via Detector-Family Threshold Gaming (CVSS 5.3)Langroid SQLChatAgent — Regex Blocklist Bypass Enables Prompt-to-SQL-to-RCE (CVSS 8.7)Langroid Neo4jChatAgent — Unvalidated LLM-Generated Cypher Enables Graph Data Destruction and Potential RCE (CVSS 9.2)Langroid TableChatAgent/VectorStore — Sandbox Escape to Remote Code Execution via Incomplete eval() Mitigation (CVSS 10.0)HalluSquatting — Adversarial Hallucination Squatting Enables Botnet Distribution via AI Coding Assistants Friendly Fire: Prompt Injection Hijacks Claude Code and Codex CLI for RCE During Defensive Code Review CrowdStrike Expands Prompt Injection Taxonomy to 200+ Techniques GitLost: Prompt Injection in GitHub Agentic Workflows Leaks Private Repository Data via Public Issues Prompt Injection Campaigns Trick Browsing AI Agents Into Sending Cryptocurrency Payments OpenAI Codex Desktop (macOS) — Indirect Prompt Injection via Markdown Image Rendering Enables Data Exfiltration AI Agent Poisoning via SEO Poisoning and Hidden HTML Prompt Injection — Agents Tricked into Fraudulent Crypto Payments Netzilo AIDR — Portable Runtime Governance for AI Agents Across Bedrock AgentCore, Copilot Studio, Vertex AI, LangGraph, CrewAI Kong Konnect MCP Server — Indirect Prompt Injection Enables Unintended API Request Execution (CVE-2026-13341)Cursor IDE 'DuneSlide' — Zero-Click Prompt Injection Escapes Sandbox for OS-Level RCE (CVE-2026-50548 / CVE-2026-50549)Nightfall AI: AI-Native DLP + MCP Security Platform with Prompt Injection Detection for Agentic Workflows DuneSlide — Cursor IDE Zero-Click Prompt Injection Sandbox Escape Enables Host RCE (CVE-2026-50548, CVE-2026-50549)System Card: Claude Sonnet 5 Snyk Evo Agentic Development Security (ADS) — Runtime Governance for AI Coding Agents, MCP Servers, and Generated Code Straiker — $64M Series A + STAR Labs Agentic Exploit Dataset; GA Platform: Agent Discovery, Pre-Deployment Red-Teaming, Runtime Protection Microsoft Defender for Endpoint: AI Agent Runtime Protection (Public Preview) — Auto-Discovery of 25+ Local Agent Types + Prompt Injection Blocking LiteLLM AI Gateway — Three-CVE RCE Chain: Default Internal User Can Escalate to Admin and Execute Arbitrary Code DeepTutor — MCP Tool Authorization Bypass Allows Low-Privilege Users to Invoke Any Configured MCP Tool F5 AI Security Platform GA + SurePath AI Acquisition — Network-Level AI Discovery, Shadow AI Detection, Runtime Guardrails University of Washington Study: Agentic AI Browsers Allow Same-Origin Policy Bypass via Prompt Injection, Working PoC Demonstrated Microsoft Defender for Endpoint: Discovery of 25+ Local AI Agent Types + MCP Server Runtime Protection Against Prompt Injection Prompt Injection Now Confirmed in Production AI Deployments — Three Enterprise Breaches Disclosed (June 2026)Agentic Red-Team Tools (12 Systems) — Systemic Sandbox Escape and API Key Exfiltration via Agent-Phishing (arXiv 2606.24496)Claude Code — Sandbox Escape via Git Worktree Path Confusion Enables Host Code Execution

References

OWASP Top 10 for LLM Applications — LLM01: Prompt Injection NIST CSRC Glossary: Prompt Injection

Track this in the live feed See how this plays out in real AI security and governance developments.

Open the feed →

Definition

Why it matters

Related terms

Demonstrated by recent findings

Findings on this topic (60)

References