PPO on Airfoil2D-medium-v0 (FluidGym)

This repository is part of the FluidGym benchmark results. It contains trained Stable Baselines3 agents for the specialized Airfoil2D-medium-v0 environment.

Evaluation Results

Global Performance (Aggregated across 5 seeds)

Mean Reward: 3.13 ± 0.07

Per-Seed Statistics

Run Mean Reward Std Dev
Seed 0 3.18 0.41
Seed 1 3.24 0.39
Seed 2 3.11 0.37
Seed 3 3.02 0.40
Seed 4 3.11 0.40

About FluidGym

FluidGym is a benchmark for reinforcement learning in active flow control.

Usage

Each seed is contained in its own subdirectory. You can load a model using:

from stable_baselines3 import PPO
model = PPO.load("0/ckpt_latest.zip")

Important: The models were trained using fluidgym==0.0.2. In order to use them with newer versions of FluidGym, you need to wrap the environment with a FlattenObservation wrapper as shown below:

import fluidgym
from fluidgym.wrappers import FlattenObservation
from stable_baselines3 import PPO

env = fluidgym.make("Airfoil2D-medium-v0")
env = FlattenObservation(env)
model = PPO.load("path_to_model/ckpt_latest.zip")

obs, info = env.reset(seed=42)

action, _ = model.predict(obs, deterministic=True)
obs, reward, terminated, truncated, info = env.step(action)

References

Downloads last month
149
Video Preview
loading

Collection including safe-autonomous-systems/ppo-Airfoil2D-medium-v0

Paper for safe-autonomous-systems/ppo-Airfoil2D-medium-v0

Evaluation results