EED Gym baseline and ablation checkpoints

This repository contains RL baseline and ablation study checkpoints for the Empathic Ethical Disobedience (EED) Gym environment.

40 zipped checkpoint files
Each folder corresponds to a trained baseline run and contains 5 .zip files (5 seeds)
- Vanilla PPO
- PPO-LSTM
- Masked PPO
- Lagrangian PPO
The ablations folder contains 4 subfolders with checkpoints for different ablation described in the paper, namely:
- no affect ablation
- no clarify/alternative ablation
- no curriculum ablation
- no trust penalty ablation

Usage

Download with git lfs or via direct file links, e.g.:

wget https://huggingface.co/inq-android/eedgym-ckpts/resolve/main/ppo_vanilla/vanilla_seed0.zip

Video Preview