Praxen

OWASP Gen AI Security — the frameworks Praxen uses

Every finding Praxen produces is tagged against industry-standard OWASP frameworks so the result lands in language your security team already speaks. This page explains where those frameworks come from and gives a one-line gloss on each risk so you can read a tag without leaving the report.

📊 See it live: the OWASP Coverage Report aggregates LLM and Agentic Top-10 coverage across Praxen's entire example suite — a browsable map of which risks each target carries, with a link into every per-target analysis. Rendered on GitHub Pages.


OWASP, briefly

The Open Worldwide Application Security Project (owasp.org) is the non-profit foundation behind the vendor-neutral, openly-licensed "Top 10" risk lists that are a baseline reference across application security.

OWASP Gen AI Security Project

As LLM-based systems moved into production, OWASP spun up a dedicated effort: the OWASP Gen AI Security Project (genai.owasp.org). It maintains the AI-specific guidance Praxen relies on — currently three documents:

Document What it covers Praxen tag prefix
OWASP Top 10 for LLM Applications 2025 Risks in applications built on large language models LLM01LLM10
OWASP Top 10 for Agentic AI Applications 2026 Risks specific to autonomous, tool-using agents ASI01ASI10
A Practical Guide for Secure MCP Server Development 2026 Securing Model Context Protocol servers and the tools they expose mcp (checklist items)

Praxen carries distilled extracts of all three in its knowledge base (skills/behavior-verifier/knowledge/), but the canonical, full-length versions live at the links above — go there for the complete write-ups, examples, and references.


OWASP Top 10 for LLM Applications 2025

The risk landscape for any system that puts an LLM in the loop. Each finding Praxen tags with one of these traces to a behavior or code pattern in the agent's evidence.

ID Risk One-line gloss
LLM01 Prompt Injection Untrusted input (direct or smuggled in via external content) overrides the model's instructions.
LLM02 Sensitive Information Disclosure The model leaks PII, secrets, or proprietary data through its outputs.
LLM03 Supply Chain Compromised or untrusted models, datasets, plugins, or dependencies enter the system.
LLM04 Data and Model Poisoning Training, fine-tuning, or RAG data is manipulated to bias or backdoor the model.
LLM05 Improper Output Handling Model output is passed downstream (shell, SQL, HTML, eval) without validation.
LLM06 Excessive Agency The model is granted more capability, permission, or autonomy than the task needs.
LLM07 System Prompt Leakage The system prompt — and any secrets or logic baked into it — is exposed.
LLM08 Vector and Embedding Weaknesses Flaws in embeddings or vector stores enable injection, poisoning, or data leakage via RAG.
LLM09 Misinformation The model produces confident, plausible, wrong output that users act on.
LLM10 Unbounded Consumption Unconstrained inference (cost, compute, rate) enables denial-of-wallet or denial-of-service.

→ Full document: OWASP Top 10 for LLM Applications 2025


OWASP Top 10 for Agentic AI Applications 2026

Risks that emerge once an LLM is wired to tools, memory, other agents, and the ability to act. This is the list that matters most for the kind of autonomous, tool-using agents Praxen is built to verify.

ID Risk One-line gloss
ASI01 Agent Goal Hijack An attacker redirects the agent's objective so it pursues their goal instead of yours.
ASI02 Tool Misuse and Exploitation The agent is steered into using its legitimate tools for illegitimate ends.
ASI03 Identity and Privilege Abuse The agent's identity, tokens, or delegated permissions are abused or over-broad.
ASI04 Agentic Supply Chain Vulnerabilities Malicious or compromised plugins, MCP servers, skills, or sub-agents enter the agent's stack.
ASI05 Unexpected Code Execution (RCE) The agent runs attacker-influenced code on the host or in connected systems.
ASI06 Memory and Context Poisoning Persistent memory or retrieved context is seeded with content that corrupts future behavior.
ASI07 Insecure Inter-Agent Communication Messages between agents are unauthenticated, unvalidated, or trusted blindly.
ASI08 Cascading Failures One bad action or output propagates through a chain of agents/tools and amplifies.
ASI09 Human-Agent Trust Exploitation The agent's perceived authority is used to manipulate the humans who rely on it (or vice versa).
ASI10 Rogue Agents An agent operates outside its remit — compromised, misconfigured, or deliberately planted.

→ Full document: OWASP Top 10 for Agentic AI Applications 2026


A Practical Guide for Secure MCP Server Development 2026

The Model Context Protocol (MCP) is how many agents discover and call external tools. MCP servers are unusual: they run with delegated user permissions, support dynamic tool loading, and can chain tool calls — so a single weakness amplifies. When Praxen finds an MCP configuration in the evidence (.mcp.json, mcp.json, or similar), it applies the guide's minimum-bar checklist across these areas:

  • Architecture — local (STDIO / loopback) vs. remote (TLS, authenticated) binding; session isolation between users and agents.
  • Tool design — least-privilege scopes; no destructive tools without confirmation; clear, non-deceptive tool descriptions.
  • Input/output validation — schema-validated arguments; sanitized outputs; no raw passthrough.
  • Prompt-injection controls — tool descriptions and returned content treated as untrusted, not as instructions.
  • Authentication & authorization — OAuth 2.1 / OIDC for remote servers; scoped, short-lived tokens; no token passthrough to downstream APIs.
  • Secrets & deployment — no credentials in config files, env files, or tool descriptions; non-root, sandboxed execution; pinned, scanned dependencies.
  • Governance & audit — tool invocations logged; a review process for new or updated tools; MCP-server identity tracked separately from the agent's.

Any "No" against the checklist is a finding; secrets in an MCP config file are a Critical.

→ Full document: A Practical Guide for Secure MCP Server Development 2026


How these tags surface in a report

A finding's primary OWASP classification appears two ways in the HTML report:

  • On the finding card itself, as a labeled tag (LLM01 — Prompt Injection, ASI05 — Unexpected Code Execution (RCE), …) that links to that entry in this page.
  • In the OWASP LLM Top 10 Coverage and OWASP Agentic Top 10 Coverage grid sections — full-bleed 5×2 cards, one per category, each populated card showing the top-three most-severe findings as clickable chips. Empty cells render "No findings" so the grid reads as a coverage map: at a glance you see both what risks the agent has and which categories the analysis did not surface.

The grids are driven by each finding's owasp_llm / owasp_agentic primary scalar; secondaries listed in a finding's tags[] array still appear on the finding card. See Interpreting Reports §9–§10 for the grid layout details. For the same coverage view aggregated across every target in Praxen's example suite, browse the live OWASP Coverage Report.


How this fits with RAISE

The OWASP frameworks above answer "what kind of risk is this finding?" The RAISE Framework answers "how mature is this agent's security posture overall?" — a six-category 0–5 score. Every Praxen finding carries both: a RAISE category tag and (where applicable) an OWASP LLM, OWASP Agentic, or MCP tag. See Interpreting Reports for how the tags appear on a finding card.


See also

Edit this page on GitHub ↗