Reference Implementation Submission: Agent Trust Bench -- live adversarial payment-flow test corpus (138 profiles, OWASP LLM Top-10 aligned)

Submitting the Agent Trust Bench as a reference implementation / community testing resource for the OWASP Top 10 for LLM Applications working group.

**Agent Trust Bench** -- https://agent-trust-bench.algovoi.co.uk
**Docs** -- https://docs.algovoi.co.uk/agent-trust-bench

**Scope:** Live adversarial test bench for AI agents that handle x402/A2A payment flows -- the agentic-payment surface currently underrepresented in OWASP LLM test corpora. 138 profiles across 30 threat categories, OWASP LLM Top-10 aligned.

**Category coverage:**

| OWASP | Profiles (count) |
|---|---|
| LLM01 Prompt Injection | injection, jailbreak-meta, replay, jailbreak-sequence (18) |
| LLM06 Sensitive Info Disclosure | credential exposure, key-leak probes (8) |
| LLM07 Insecure Plugin Design | malformed schema, structural placement, amount formatting (16) |
| LLM08 Excessive Agency | orchestrator-auth bypass, capability-inject, commitment-forcing (14) |
| LLM09 Overreliance | authority spoofing, urgency, mismatch, currency drift (22) |
| LLM02 Insecure Output Handling | response validation, output injection (6) |
| LLM05 Supply Chain | dependency confusion, tampered tool response (4) |
| FATF | sanctioned counterparty, jurisdiction override, velocity (12) |

**Distinctive properties:**
- Live endpoint, not a static dataset -- agents make real HTTP calls against adversarial payment challenges
- Fake-signed, safe replay -- $1 USDC cap enforced server-side, 30-day responsible-disclosure window
- Three reference personas (safe / aggressive / unbounded) for cross-LLM comparison
- Machine-readable discovery for CI integration: `GET https://agent-trust-bench.algovoi.co.uk/.well-known/x402.json`
- Per-profile event log -- failures are debuggable, not just numeric

MIT-licensed runner. No account required.

Happy to answer questions from the working group or contribute profiles to the ASI taxonomy mapping effort.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Reference Implementation Submission: Agent Trust Bench -- live adversarial payment-flow test corpus (138 profiles, OWASP LLM Top-10 aligned) #839

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

OWASP	Profiles (count)
LLM01 Prompt Injection	injection, jailbreak-meta, replay, jailbreak-sequence (18)
LLM06 Sensitive Info Disclosure	credential exposure, key-leak probes (8)
LLM07 Insecure Plugin Design	malformed schema, structural placement, amount formatting (16)
LLM08 Excessive Agency	orchestrator-auth bypass, capability-inject, commitment-forcing (14)
LLM09 Overreliance	authority spoofing, urgency, mismatch, currency drift (22)
LLM02 Insecure Output Handling	response validation, output injection (6)
LLM05 Supply Chain	dependency confusion, tampered tool response (4)
FATF	sanctioned counterparty, jurisdiction override, velocity (12)

Uh oh!

Reference Implementation Submission: Agent Trust Bench -- live adversarial payment-flow test corpus (138 profiles, OWASP LLM Top-10 aligned) #839

Description

Metadata

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Issue actions