technicals

What is prompt injection?

June 2, 2026 · 4 min read

PROMPT INJECTIONIt obeys whatever slips through the slot.A stranger's note, tucked inside the owner's letter, gets followed.hidden noteowner's letterthe AI (clerk)obeys the strangerThe model can't tell trusted instructions from text smuggled in with the data.

Definition

Prompt injection is an attack that smuggles hidden instructions into an AI assistant’s input so it ignores its real job and does what the attacker wants instead.

At a glance

Why your business should care

If you connect an AI assistant to your email, files, or customer data, a single poisoned message or document can hijack it. In 2025, prompt-injection incidents leaked chat records, login credentials, and confidential files from tools linked to ChatGPT.[3] The AI was working as designed, which is exactly the problem.

How attackers pull it off

They hide commands where your AI will read them, like white text in a webpage, a note in an email, or instructions in a shared document. The AI treats that planted text as a legitimate order.[2] Stanford student Kevin Liu famously used Ignore previous instructions to make Bing Chat reveal its secret internal rules.[4]

Bottom line

Treat any text your AI reads as a potential instruction from a stranger, and never connect AI tools to sensitive systems without limits and human review.

Connects to Computer ScienceLaw

References

  1. LLM01:2025 Prompt Injection - OWASP Gen AI Security Project. OWASP Foundation genai.owasp.org
  2. What Is a Prompt Injection Attack? IBM www.ibm.com
  3. Prompt Injection: An Analysis of Recent LLM Security Incidents. NSFOCUS nsfocusglobal.com
  4. Prompt Injection | OWASP Foundation. OWASP Foundation owasp.org