PPO on RBC2D-hard-v0 (FluidGym)

This repository is part of the FluidGym benchmark results. It contains trained Stable Baselines3 agents for the specialized RBC2D-hard-v0 environment.

Evaluation Results

Global Performance (Aggregated across 5 seeds)

Mean Reward: -0.46 ± 0.12

Per-Seed Statistics

Run Mean Reward Std Dev
Seed 0 -0.44 2.50
Seed 1 -0.24 2.47
Seed 2 -0.57 2.78
Seed 3 -0.58 2.58
Seed 4 -0.48 2.58

About FluidGym

FluidGym is a benchmark for reinforcement learning in active flow control.

Usage

Each seed is contained in its own subdirectory. You can load a model using:

from stable_baselines3 import PPO
model = PPO.load("0/ckpt_latest.zip")

References

Downloads last month
-
Video Preview
loading

Collection including safe-autonomous-systems/ma-ppo-RBC2D-hard-v0

Paper for safe-autonomous-systems/ma-ppo-RBC2D-hard-v0

Evaluation results