Question 1

What is Agentjacking?

Accepted Answer

An attack in which an adversary hijacks a running AI agent — taking control of its actions mid-task — by feeding it malicious instructions through the data or tools it interacts with. The attacker essentially 'steers' the agent to execute harmful commands (run malware, exfiltrate data) as if those commands came from the legitimate operator.

Question 2

Why does Agentjacking matter for AI security?

Accepted Answer

Because agents have broad system access — developer credentials, code repositories, cloud infrastructure — a successful agentjacking can be as damaging as a full network intrusion. It was documented in production enterprise environments in 2026.

Agentjacking

Definition

Why it matters

References

Agentjacking

Definition

Why it matters

Demonstrated by recent findings

Findings on this topic (4)

References