AI Security // Offensive AI Operations

LLM and agent systems as a live attack surface.

Model-backed products, agent systems and retrieval pipelines treated as a live offensive surface.

6 notesred-team focusmodel abuse

Domain overview

AI security is useful only when it moves beyond novelty. This domain focuses on prompt injection, tool misuse, retrieval abuse, memory contamination, model APIs and operational red teaming.

Good AI assessment work combines application security, API review, auth logic, cloud exposure and workflow abuse with model-specific pressure. Prompt injection, context poisoning, output steering, authority confusion, unsafe tool invocation, retrieval exfiltration and agent compromise are all just different ways of asking whether language can seize control of automation.

Primary operator questions

  • Can untrusted content override or reshape the hidden instruction hierarchy?
  • Can a retrieved document, email, web page or ticket poison the model's planning path?
  • Can the assistant call tools, query data stores or send actions with more authority than it should?
  • Can model output be trusted by code, analysts or business workflows without verification?
  • Can the system be pushed from harmless chat into data exposure, lateral movement or destructive action?

Red-team pressure lines

Useful pressure usually follows five lanes. First, instruction attacks: direct prompt injection, indirect prompt injection, jailbreak chaining and system prompt leakage. Second, retrieval attacks: poisoned corpora, malicious documents, embedded instructions and confidence laundering through RAG. Third, agent abuse: unauthorized tool use, connector overreach, action replay and confirmation bypass. Fourth, API and inference weaknesses: weak auth, file-handling mistakes, quota abuse, plugin boundaries and tenant leakage. Fifth, reporting discipline: proving whether the behavior is reachable, repeatable and tied to real business impact.

Related certification context

These certifications are not the point of the domain, but they are useful orientation anchors for operators who want a formal practice path beside the field notes.

Selected public references

Topic index

brief

AI Attack Surface Primer

Model-backed attack paths across prompts, retrieval, orchestration and tool invocation.

surface maptrust boundaries
brief

Prompt Injection & Jailbreaks

Model-backed attack paths across prompts, retrieval, orchestration and tool invocation.

prompt supply chainpolicy bypass
brief

RAG, Agents & Tool Abuse

Model-backed attack paths across prompts, retrieval, orchestration and tool invocation.

ragagent abuse
brief

AI Red Teaming Methodology

Model-backed attack paths across prompts, retrieval, orchestration and tool invocation.

workflowreporting
brief

LLM Pentesting Note

Industrial protocols, engineering trust and process-level exposure in operational environments.

cross-linkspecialist note