Robust Transfer of Safety-Constrained Reinforcement Learning Agents

This repository provides code and instructions for training agents under action disturbances for a safe and robust transfer between environments with different dynamics, preventing safety violations after the transfer.

Installation

This code builds up on OmniSafe.

Clone omnisafe:

git clone https://github.com/PKU-Alignment/omnisafe

Clone this repository:

git clone https://github.com/ai-fm/safe-and-robust-transfer

Add the source code from this project to omnisafe:

Copy the files in ./safe-and-robust-transfer/src/algorithms/ into ./omnisafe/omnisafe/algorithms/, and include these algorithms in ./omnisafe/omnisafe/algorithms/__init__.py.
Copy the files in ./safe-and-robust-transfer/src/configs/ into ./omnisafe/omnisafe/configs/
Copy the files in ./safe-and-robust-transfer/src/envs/ into ./omnisafe/omnisafe/envs/.

Done! Now the project can be installed with

cd omnisafe
pip install -e .

Please take a look at the OmniSafe installation instructions for more details.

Running

All scripts are located in src/scripts/:

Training the guides

src/scripts/train/train_guides.py trains the guides.

Use DDPGNoise to train a guide with random noise.
Use DDPGAdversarial to train a guide with adversarial perturbations.
Use SACLag to train a guide with entropy maximization.

Environment options are SafetyPointGuide1-v0, SafetyPointGuide2-v0, and SafetyPointGuide3-v0.

Training the students

src/scripts/train/train_students.py trains the students.

Use SaGuiCS if the guide is nondeterministic (SAC).
Use SaGuiCSDet if the guide is deterministic (DDPG).

Environment options are SafetyPointStudent1-v0, SafetyPointStudent2-v0, and SafetyPointStudent3-v0.

Robustness

src/scripts/robustness measures the robustness of an agent.

Make sure to provide:

A config.json file.
A torch_save/{MODEL_FNAME} file, where MODEL_FNAME usually looks like epoch-XXX.pt.

License

This code is licensed under the terms of the Apache License.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
src		src
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
overview.svg		overview.svg

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Robust Transfer of Safety-Constrained Reinforcement Learning Agents

Installation

Running

Training the guides

Training the students

Robustness

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Robust Transfer of Safety-Constrained Reinforcement Learning Agents

Installation

Running

Training the guides

Training the students

Robustness

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages