GitHub - m4p1e/agent-sentinel: AgentSentinel: An End-to-End and Real-Time Security Defense Framework for Computer-Use Agents

Environment Requirement

Our test environment configuration as follows.

Ubuntu 20.04
Docker version 27.2.1
LLVM-12 (required by ebpf-go)
libbpf-dev 1:0.5.0-1~ubuntu20.04.1
Enable BPF-LSM. Check cat /sys/kernel/security/lsm to ensure it is enabled.
Golang: go1.23.3 linux/amd64
Python 3.11

Build Test Environment

Compile AgentSentinel

Download (by git clone git@github.com:m4p1e/agent-sentinel.git) and move this entire project to $GOPATH.

# install go dependency
go mod download

# (optional) manually compile eBPF program
go generate ./...

# compile
go build

You will get a binary named agent-sentinel.

Build Computer-Use Agent

cd tests/claude_computer_use_demo_test/
docker build . -t computer-use-demo-test

Ensure that no image name collsion for computer-use-demo-test.

Build Mock Server

cd tests/scripts/attack_mock_server
docker build . -t computer-use-demo-test-attack-server

Ensure that no image name collsion for computer-use-demo-test-attack-server.

Create Basic Mount Directory

For mounting the monitoring server socket file.

mkdir ~/.anthropic

Configure SUDO Permission for Run AgentSentinel

Add $project_dir/tests/scripts/start_sandbox.sh and $project_dir/tests/scripts/stop_sandbox.sh to /etc/sudoers. To make sure that AgentSentinel can run with ROOT permission.

For example

sudo cat /etc/sudoers
# ...
# your_username ALL = NOPASSWD:$project_dir/tests/scripts/start_sandbox.sh
# your_username ALL = NOPASSWD:$project_dir/tests/scripts/stop_sandbox.sh

Set API Key

Add related envrionmenet variables.

# for claude models
export ANTHROPIC_API_KEY=xxx
# for openai models
export OPENAI_API_KEY=xxx

Modify these envrionmenet variables in tests/scripts/start_sandbox.sh and tests/scripts/start_tool_use_guard.sh.

Modify api_key in following basic config files:

tests/scripts/claude35_basic_config.json
tests/scripts/claude37_basic_config.json
tests/scripts/gpt_4_turbo_basic_config.json
tests/scripts/gpt_4o_basic_config.json

Install Require Python Package for Run Test Scripts

pip install docker

Patch some File Paths

For tests/scripts/run_single_task.py:

- HOST_MOUNT_DIR = "~/.anthropic"
+ HOST_MOUNT_DIR = "absolute_path_of_your_home_dir/.anthropic"

- "../dataset/attacks/attack4agent2/attack4agent2.json" : 300
+ "absolute_path_of_agentsentinel_dir/tests/dataset/attacks/attack4agent2/attack4agent2.json" : 300

For tests/scripts/start_sandbox.sh and tests/scripts/start_tool_use_guard.sh:

- -mount-path=~/.anthropic \
+ -mount-path=absolute_path_of_your_home_dir/.anthropic \

Run Test

Ensure that above environment configuration is done.

First, move workdir to cd tests/scripts.

Try for a test:

python run_single_task.py example_basic_config_normal.json example_task_config.json

To run a experiment:

# run direct task injection attacks, with agent (claude 3.5 sonnet), no defense
python run_all_tasks.py -c claude35_basic_config.json -t all_attack_task.json -a direct_task_inject

# run direct task injection attacks, with agent (gpt-4o), no defense
python run_all_tasks.py -c gpt_4o_basic_config.json -t all_attack_task.json -a direct_task_inject

# run direct task injection attacks, with agent (claude 3.5 sonnet), with AgentSentinel defense
python run_all_tasks.py -c claude35_basic_config.json -t all_attack_task.json -a direct_task_inject -d sandbox

# ...

More experiment commands can be seen in run_all_tasks.py.

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
api		api
audit		audit
client/python		client/python
ebpf		ebpf
handler		handler
helper		helper
log		log
tests		tests
tracer		tracer
.gitignore		.gitignore
README.md		README.md
go.mod		go.mod
go.sum		go.sum
main.go		main.go

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Environment Requirement

Build Test Environment

Compile AgentSentinel

Build Computer-Use Agent

Build Mock Server

Create Basic Mount Directory

Configure SUDO Permission for Run AgentSentinel

Set API Key

Install Require Python Package for Run Test Scripts

Patch some File Paths

Run Test

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Environment Requirement

Build Test Environment

Compile AgentSentinel

Build Computer-Use Agent

Build Mock Server

Create Basic Mount Directory

Configure SUDO Permission for Run AgentSentinel

Set API Key

Install Require Python Package for Run Test Scripts

Patch some File Paths

Run Test

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages