RyanAA
/

ppo-SnowballTarget

Reinforcement Learning

ML-Agents-SnowballTarget

Model card Files Files and versions

Metrics Training metrics Community

RyanAA commited on May 15

Commit

978f2cb

·

verified ·

1 Parent(s): 1f5948d

Updated README.md

Files changed (1) hide show

README.md +33 -23

README.md CHANGED Viewed

@@ -1,32 +1,42 @@
-%%writefile README.md
-# PPO SnowballTarget Agent
-This model was trained using Proximal Policy Optimization (PPO) with Unity ML-Agents as part of the Hugging Face Deep Reinforcement Learning Course.
-## Environment
-- Unity ML-Agents
-- SnowballTarget environment
-## Training Details
-- Algorithm: PPO
-- Total training steps: 200,000
-- Final mean reward: ~23.2
-## Results
-The agent learned to consistently hit targets in the SnowballTarget environment and achieved stable rewards during training.
-Final training logs:
-- Step 160000 → Mean Reward: 22.84
-- Step 170000 → Mean Reward: 22.85
-- Step 180000 → Mean Reward: 23.00
-- Step 190000 → Mean Reward: 23.46
-- Step 200000 → Mean Reward: 23.21
-## Files
-- `SnowballTarget.onnx` — trained Unity ML-Agents policy network
 ## Usage
-This model can be loaded into Unity ML-Agents for inference and evaluation.
-## Author
-Ryan Aparicio

+---
+tags:
+- reinforcement-learning
+- ml-agents
+- ppo
+- unity
+- SnowballTarget
+license: mit
+---
+# PPO SnowballTarget
+This is a trained PPO agent playing SnowballTarget using Unity ML-Agents.
+## Environment
+SnowballTarget
+## Algorithm
+PPO (Proximal Policy Optimization)
+## Training Results
+Final mean reward: ~23.2 after 200k training steps.
 ## Usage
+You can watch the agent play directly in your browser:
+1. Go to:
+https://huggingface.co/spaces/ThomasSimonini/ML-Agents-SnowballTarget
+2. In the model selector, enter:
+RyanAA/ppo-SnowballTarget
+3. Select `SnowballTarget.onnx`
+4. Click "Watch the agent play"
+## Files
+- `SnowballTarget.onnx` — trained policy network