OWASP Gen AI Security — the frameworks Praxen uses
Every finding Praxen produces is tagged against industry-standard OWASP frameworks so the result lands in language your security team already speaks. This page explains where those frameworks come from and gives a one-line gloss on each risk so you can read a tag without leaving the report.
📊 See it live: the OWASP Coverage Report aggregates LLM and Agentic Top-10 coverage across Praxen's entire example suite — a browsable map of which risks each target carries, with a link into every per-target analysis. Rendered on GitHub Pages.
OWASP, briefly
The Open Worldwide Application Security Project (owasp.org) is the non-profit foundation behind the vendor-neutral, openly-licensed "Top 10" risk lists that are a baseline reference across application security.
OWASP Gen AI Security Project
As LLM-based systems moved into production, OWASP spun up a dedicated effort: the OWASP Gen AI Security Project (genai.owasp.org). It maintains the AI-specific guidance Praxen relies on — currently three documents:
| Document | What it covers | Praxen tag prefix |
|---|---|---|
| OWASP Top 10 for LLM Applications 2025 | Risks in applications built on large language models | LLM01–LLM10 |
| OWASP Top 10 for Agentic AI Applications 2026 | Risks specific to autonomous, tool-using agents | ASI01–ASI10 |
| A Practical Guide for Secure MCP Server Development 2026 | Securing Model Context Protocol servers and the tools they expose | mcp (checklist items) |
Praxen carries distilled extracts of all three in its knowledge base (skills/behavior-verifier/knowledge/), but the canonical, full-length versions live at the links above — go there for the complete write-ups, examples, and references.
OWASP Top 10 for LLM Applications 2025
The risk landscape for any system that puts an LLM in the loop. Each finding Praxen tags with one of these traces to a behavior or code pattern in the agent's evidence.
→ Full document: OWASP Top 10 for LLM Applications 2025
OWASP Top 10 for Agentic AI Applications 2026
Risks that emerge once an LLM is wired to tools, memory, other agents, and the ability to act. This is the list that matters most for the kind of autonomous, tool-using agents Praxen is built to verify.
→ Full document: OWASP Top 10 for Agentic AI Applications 2026
A Practical Guide for Secure MCP Server Development 2026
The Model Context Protocol (MCP) is how many agents discover and call external tools. MCP servers are unusual: they run with delegated user permissions, support dynamic tool loading, and can chain tool calls — so a single weakness amplifies. When Praxen finds an MCP configuration in the evidence (.mcp.json, mcp.json, or similar), it applies the guide's minimum-bar checklist across these areas:
- Architecture — local (STDIO / loopback) vs. remote (TLS, authenticated) binding; session isolation between users and agents.
- Tool design — least-privilege scopes; no destructive tools without confirmation; clear, non-deceptive tool descriptions.
- Input/output validation — schema-validated arguments; sanitized outputs; no raw passthrough.
- Prompt-injection controls — tool descriptions and returned content treated as untrusted, not as instructions.
- Authentication & authorization — OAuth 2.1 / OIDC for remote servers; scoped, short-lived tokens; no token passthrough to downstream APIs.
- Secrets & deployment — no credentials in config files, env files, or tool descriptions; non-root, sandboxed execution; pinned, scanned dependencies.
- Governance & audit — tool invocations logged; a review process for new or updated tools; MCP-server identity tracked separately from the agent's.
Any "No" against the checklist is a finding; secrets in an MCP config file are a Critical.
→ Full document: A Practical Guide for Secure MCP Server Development 2026
How these tags surface in a report
A finding's primary OWASP classification appears two ways in the HTML report:
- On the finding card itself, as a labeled tag (
LLM01 — Prompt Injection,ASI05 — Unexpected Code Execution (RCE), …) that links to that entry in this page. - In the OWASP LLM Top 10 Coverage and OWASP Agentic Top 10 Coverage grid sections — full-bleed 5×2 cards, one per category, each populated card showing the top-three most-severe findings as clickable chips. Empty cells render "No findings" so the grid reads as a coverage map: at a glance you see both what risks the agent has and which categories the analysis did not surface.
The grids are driven by each finding's owasp_llm / owasp_agentic primary scalar; secondaries listed in a finding's tags[] array still appear on the finding card. See Interpreting Reports §9–§10 for the grid layout details. For the same coverage view aggregated across every target in Praxen's example suite, browse the live OWASP Coverage Report.
How this fits with RAISE
The OWASP frameworks above answer "what kind of risk is this finding?" The RAISE Framework answers "how mature is this agent's security posture overall?" — a six-category 0–5 score. Every Praxen finding carries both: a RAISE category tag and (where applicable) an OWASP LLM, OWASP Agentic, or MCP tag. See Interpreting Reports for how the tags appear on a finding card.
See also
- The RAISE Framework — the maturity-scoring side of the analysis
- Interpreting Reports — where these tags show up in a Praxen report
- Challenging and Revising Findings — what to do when you disagree with a finding
