Book demo
Sample report

AgentSec

Red-team assessment for high-privilege autonomous agents

Proof

Live sample

Sample assessment — illustrative data only. Not a live red-team environment.

Security score 58
Risk type medium
Tests 12
Vulnerable 4

Sample AgentSec assessment of a high-privilege autonomous agent. Red-team probes identified tool-output poisoning and insufficient input validation on web-search result handling. Static audit flagged broad filesystem read scope in SOUL configuration.

Benchmark lanes

Injection resistance 62
Tool boundary enforcement 48
Static configuration hygiene 71

Findings

Tool output poisoning via search result High

Adversarial content embedded in a simulated web-search result caused the agent to suggest reading sensitive configuration files.

ASI06
Over-broad read_file scope Medium

Static audit found SOUL instructions allowing reads outside the intended workspace directory.

ASI07
Progressive elicitation partial success Low

Multi-turn elicitation extracted internal tool names but did not achieve credential disclosure.

ASI01

OWASP Agentic ASI

ASI01 Agent Goal Hijack 3 findings · max LOW
ASI06 Memory & Context Poisoning 2 findings · max HIGH
ASI07 Insecure Inter-Agent Communication 1 findings · max MEDIUM

Video walkthrough

AgentSec demo Demo

Product demo — not a live environment.

Process

How it works

  1. 1
    Start session

    Target a high-privilege agent (victim) in an isolated environment.

  2. 2
    Static & credential scan

    Audit SOUL, config, and exposed endpoints.

  3. 3
    Red-team probes

    Injection, tool-output poisoning, progressive elicitation.

  4. 4
    Benchmark report

    OWASP Agentic ASI mapping, score, and hardening recommendations.

Details

Capabilities

  • Progressive elicitation and injection testing
  • Static audit of agent configuration and tools
  • Session-based adversarial probing
  • Benchmark-style scoring and narrative report

MCP tools

  • agentsec_start_session
  • agentsec_generate_report

Ready to see ClawDesk in action?