Reinforcement Learning
stable-baselines3
PandaReachDense-v3
a2c
panda-gym
deep-rl-class
Eval Results (legacy)
Instructions to use jnforja/a2c-PandaReachDense-v3 with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- stable-baselines3
How to use jnforja/a2c-PandaReachDense-v3 with stable-baselines3:
from huggingface_sb3 import load_from_hub checkpoint = load_from_hub( repo_id="jnforja/a2c-PandaReachDense-v3", filename="{MODEL FILENAME}.zip", ) - Notebooks
- Google Colab
- Kaggle
A2C Agent Playing PandaReachDense-v3
This agent was trained locally with Stable-Baselines3 A2C using the Hugging Face Deep RL Course Unit 6 setup.
Results
- Mean reward:
-0.19 +/- 0.12 - Evaluation episodes:
10 - Timesteps:
1000000
Hyperparameters
env_id: PandaReachDense-v3
repo_id: jnforja/a2c-PandaReachDense-v3
model_name: a2c-PandaReachDense-v3
seed: 42
n_envs: 4
total_timesteps: 1000000
policy: MultiInputPolicy
model_architecture: A2C
norm_obs: true
norm_reward: true
clip_obs: 10.0
eval_episodes: 10
video_episodes: 1
min_video_seconds: 3
Files
a2c-PandaReachDense-v3.zip: Stable-Baselines3 A2C checkpoint.vec_normalize.pkl: VecNormalize observation/reward statistics.replay.mp4: rendered preview episode.results.json: evaluation output.
- Downloads last month
- 13
Evaluation results
- mean_reward on PandaReachDense-v3self-reported-0.19 +/- 0.12