hamzasheedi commited on
Commit
7911548
·
verified ·
1 Parent(s): 2b88805

Upload 4 files

Browse files
Files changed (4) hide show
  1. README.md +68 -0
  2. gitattributes +35 -0
  3. metadata.json +7 -0
  4. model.zip +3 -0
README.md ADDED
@@ -0,0 +1,68 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ tags:
3
+ - deep-reinforcement-learning
4
+ - reinforcement-learning
5
+ - stable-baselines3
6
+ - BipedalWalker-v3
7
+ - PPO
8
+ - SAC
9
+ library_name: stable-baselines3
10
+ model_name: ppo
11
+ ---
12
+
13
+ # 🤖 PPO/SAC Agent for BipedalWalker-v3
14
+
15
+ This is a trained agent that learned to walk on two legs from scratch!
16
+
17
+ ## Model Description
18
+
19
+ - **Algorithm**: PPO or SAC (Soft Actor-Critic)
20
+ - **Environment**: BipedalWalker-v3
21
+ - **Framework**: Stable-Baselines3
22
+ - **Training Steps**: 500,000 steps
23
+
24
+ ## Performance
25
+
26
+ - **Walking Success**: Consistent bipedal locomotion
27
+ - **Average Reward**: 200+ (successful walking)
28
+ - **Coordination**: Learned proper leg coordination and balance
29
+
30
+ ## Usage
31
+
32
+ ```python
33
+ from stable_baselines3 import PPO
34
+ import gymnasium as gym
35
+
36
+ # Load the trained model
37
+ model = PPO.load("bipedal_walker_ppo_model")
38
+
39
+ # Create environment
40
+ env = gym.make('BipedalWalker-v3', render_mode='human')
41
+
42
+ # Watch it walk!
43
+ obs, _ = env.reset()
44
+ for _ in range(2000):
45
+ action, _ = model.predict(obs, deterministic=True)
46
+ obs, reward, terminated, truncated, info = env.step(action)
47
+ if terminated or truncated:
48
+ obs, _ = env.reset()
49
+
50
+ env.close()
51
+ ```
52
+
53
+ ## Training Details
54
+
55
+ The agent learned to coordinate:
56
+ - 4 continuous joint controls (hip + knee for each leg)
57
+ - Balance and momentum management
58
+ - Forward locomotion
59
+ - Obstacle navigation
60
+
61
+ ## What Makes This Impressive
62
+
63
+ - **24-dimensional state space** - Complex sensory input
64
+ - **Continuous control** - Smooth joint movements
65
+ - **Physics simulation** - Realistic walking dynamics
66
+ - **From scratch learning** - No pre-programmed walking patterns
67
+
68
+ Amazing to watch a robot learn to walk! 🚶‍♂️
gitattributes ADDED
@@ -0,0 +1,35 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ *.7z filter=lfs diff=lfs merge=lfs -text
2
+ *.arrow filter=lfs diff=lfs merge=lfs -text
3
+ *.bin filter=lfs diff=lfs merge=lfs -text
4
+ *.bz2 filter=lfs diff=lfs merge=lfs -text
5
+ *.ckpt filter=lfs diff=lfs merge=lfs -text
6
+ *.ftz filter=lfs diff=lfs merge=lfs -text
7
+ *.gz filter=lfs diff=lfs merge=lfs -text
8
+ *.h5 filter=lfs diff=lfs merge=lfs -text
9
+ *.joblib filter=lfs diff=lfs merge=lfs -text
10
+ *.lfs.* filter=lfs diff=lfs merge=lfs -text
11
+ *.mlmodel filter=lfs diff=lfs merge=lfs -text
12
+ *.model filter=lfs diff=lfs merge=lfs -text
13
+ *.msgpack filter=lfs diff=lfs merge=lfs -text
14
+ *.npy filter=lfs diff=lfs merge=lfs -text
15
+ *.npz filter=lfs diff=lfs merge=lfs -text
16
+ *.onnx filter=lfs diff=lfs merge=lfs -text
17
+ *.ot filter=lfs diff=lfs merge=lfs -text
18
+ *.parquet filter=lfs diff=lfs merge=lfs -text
19
+ *.pb filter=lfs diff=lfs merge=lfs -text
20
+ *.pickle filter=lfs diff=lfs merge=lfs -text
21
+ *.pkl filter=lfs diff=lfs merge=lfs -text
22
+ *.pt filter=lfs diff=lfs merge=lfs -text
23
+ *.pth filter=lfs diff=lfs merge=lfs -text
24
+ *.rar filter=lfs diff=lfs merge=lfs -text
25
+ *.safetensors filter=lfs diff=lfs merge=lfs -text
26
+ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
27
+ *.tar.* filter=lfs diff=lfs merge=lfs -text
28
+ *.tar filter=lfs diff=lfs merge=lfs -text
29
+ *.tflite filter=lfs diff=lfs merge=lfs -text
30
+ *.tgz filter=lfs diff=lfs merge=lfs -text
31
+ *.wasm filter=lfs diff=lfs merge=lfs -text
32
+ *.xz filter=lfs diff=lfs merge=lfs -text
33
+ *.zip filter=lfs diff=lfs merge=lfs -text
34
+ *.zst filter=lfs diff=lfs merge=lfs -text
35
+ *tfevents* filter=lfs diff=lfs merge=lfs -text
metadata.json ADDED
@@ -0,0 +1,7 @@
 
 
 
 
 
 
 
 
1
+ {
2
+ "algorithm": "PPO",
3
+ "environment": "BipedalWalker-v3",
4
+ "framework": "stable-baselines3",
5
+ "upload_date": "2025-09-20T20:10:01.676730",
6
+ "description": "Learned bipedal locomotion from scratch"
7
+ }
model.zip ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:0c851d12caf37f6c6d2001266e5d6ee9745975bc82435cad6b750573515c5482
3
+ size 3292414