winkin119 commited on
Commit
7c75dfc
·
verified ·
1 Parent(s): b56cb34

upload via upload_folder 2025-07-26T20:09:33.953403+00:00

Browse files
Files changed (5) hide show
  1. README.md +35 -0
  2. eval_result.json +6 -0
  3. params.json +41 -0
  4. replay.mp4 +0 -0
  5. sac_pendulum.pth +3 -0
README.md ADDED
@@ -0,0 +1,35 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ env_name: Pendulum-v1
3
+ tags:
4
+ - Pendulum-v1
5
+ - sac
6
+ - reinforcement-learning
7
+ - custom-implementation
8
+ - SAC
9
+ - Pendulum
10
+ model-index:
11
+ - name: SAC-PendulumV1
12
+ results:
13
+ - task:
14
+ type: reinforcement-learning
15
+ name: reinforcement-learning
16
+ dataset:
17
+ name: Pendulum-v1
18
+ type: Pendulum-v1
19
+ metrics:
20
+ - type: mean_reward
21
+ value: -129.63 +/- 63.60
22
+ name: mean_reward
23
+ verified: false
24
+ ---
25
+
26
+ # **SAC** Agent playing **Pendulum-v1**
27
+ This is a trained model of a **SAC** agent playing **Pendulum-v1**.
28
+
29
+ ## Usage
30
+
31
+ model = load_from_hub(repo_id="winkin119/SAC-PendulumV1", filename="sac_pendulum.pth")
32
+
33
+
34
+ env = gym.make("Pendulum-v1")
35
+ ...
eval_result.json ADDED
@@ -0,0 +1,6 @@
 
 
 
 
 
 
 
1
+ {
2
+ "mean_reward": -129.63262623823556,
3
+ "std_reward": 63.59632357260991,
4
+ "datetime": "2025-07-26T19:43:29.520758+00:00",
5
+ "train_duration_min": "1.27"
6
+ }
params.json ADDED
@@ -0,0 +1,41 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "env_config": {
3
+ "env_id": "Pendulum-v1",
4
+ "env_kwargs": {},
5
+ "max_steps": null,
6
+ "use_image": false,
7
+ "vector_env_num": 6,
8
+ "use_multi_processing": true,
9
+ "image_shape": null,
10
+ "frame_stack": 1,
11
+ "frame_skip": 1,
12
+ "training_render_mode": null
13
+ },
14
+ "device": "cpu",
15
+ "learning_rate": 0.0003,
16
+ "gamma": 0.99,
17
+ "checkpoint_pathname": "",
18
+ "max_grad_norm": 0.5,
19
+ "log_interval": 100,
20
+ "eval_episodes": 50,
21
+ "eval_random_seed": 42,
22
+ "eval_video_num": 10,
23
+ "total_steps": 120000,
24
+ "hidden_sizes": [
25
+ 128,
26
+ 128
27
+ ],
28
+ "use_layer_norm": true,
29
+ "critic_lr": 0.0003,
30
+ "replay_buffer_capacity": 96000,
31
+ "batch_size": 128,
32
+ "update_start_step": 10000,
33
+ "alpha": 0.2,
34
+ "auto_tune_alpha": true,
35
+ "alpha_lr": 0.0003,
36
+ "target_entropy": -1.0,
37
+ "tau": 0.005,
38
+ "max_action": 2.0,
39
+ "log_std_min": -20,
40
+ "log_std_max": 2
41
+ }
replay.mp4 ADDED
Binary file (27.9 kB). View file
 
sac_pendulum.pth ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:fb09c57c3b134e8c76147a7e95f2c44cd60fe0acd0f67ea16c658786a814d744
3
+ size 77597