How It Works

Sage intercepts tool calls made by AI agents, extracts security-relevant artifacts, and checks them against multiple threat detection layers.

Detection Layers

URL reputation - Cloud-based lookup for malware, phishing, and scam URLs. Works without an API key.
Local heuristics - YAML-based regex patterns matching dangerous commands, suspicious URLs, sensitive file paths, credential exposure, and obfuscation techniques.
Package supply-chain checks - Registry existence, file reputation, and age analysis for npm/PyPI packages. See Package Protection.
Plugin scanning - Scans other installed plugins for threats at session start. See Plugin Scanning.

Intercepted Tools

Platform	Hooks / Tools
Claude Code	`PreToolUse` on `Bash`, `WebFetch`, `Write`, `Edit`, `Read`
Cursor	`beforeShellExecution`, `preToolUse` (Write, Edit, Delete, WebFetch), `beforeMCPExecution`, `beforeReadFile`
VS Code	`PreToolUse` on `Bash`, `WebFetch`, `Write`, `Edit`, `Read`, `Delete`
OpenClaw	`exec`, `web_fetch`, `write`, `edit`, `read`, `apply_patch`

Data Flow

Tool call received
  │
  ├─ Extract artifacts (URLs, commands, file paths, content)
  │
  ├─ Check allowlist → if allowlisted → allow
  │
  ├─ Check cache → if cached → use cached verdict
  │
  ├─ Run local heuristics (pattern matching against threat definitions)
  │
  ├─ Query URL reputation (for extracted URLs)
  │
  ├─ Check packages (for install commands / manifest writes)
  │
  ├─ Decision engine combines all signals → verdict
  │
  ├─ Cache result
  │
  └─ Audit log → return verdict

Verdicts

Decision	Meaning
`allow`	No threats detected — tool call proceeds
`ask`	Suspicious — user presented with approval dialog
`deny`	Confirmed threat — tool call blocked

When multiple signals fire, merge precedence is: deny > ask > allow.

Fail-Open Design

Sage is designed to never break the agent. Every internal error path returns an allow verdict. Extension hooks (Cursor / VS Code) always exit with code 0; the host uses the JSON response to decide whether to block the tool call. If the URL reputation API is down or times out, Sage falls back to heuristics only.

Sensitivity Presets

The confidence threshold determines when a detection escalates from ask to deny:

Preset	Threshold	Behavior
`paranoid`	0.70	Blocks on any suspicion
`balanced`	0.85	Blocks confirmed threats, warns on suspicious (default)
`relaxed`	0.95	Only blocks high-confidence malware

Configure in ~/.sage/config.json with "sensitivity": "paranoid".

On text-based connectors (OpenCode), paranoid mode also promotes all ask verdicts to deny. OpenCode relies on the agent to relay approval prompts, which is susceptible to prompt-injection auto-approval. Claude Code, Cursor, and OpenClaw use native approval dialogs and are unaffected.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How It Works

Detection Layers

Intercepted Tools

Data Flow

Verdicts

Fail-Open Design

Sensitivity Presets

FilesExpand file tree

how-it-works.md

Latest commit

History

how-it-works.md

File metadata and controls

How It Works

Detection Layers

Intercepted Tools

Data Flow

Verdicts

Fail-Open Design

Sensitivity Presets