A5omic commited on
Commit
a0fa3e3
·
verified ·
1 Parent(s): 0c9a4ab

Reupload with model files and replay

Browse files
README.md ADDED
@@ -0,0 +1,37 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ library_name: stable-baselines3
3
+ tags:
4
+ - PongNoFrameskip-v4
5
+ - deep-reinforcement-learning
6
+ - reinforcement-learning
7
+ - stable-baselines3
8
+ model-index:
9
+ - name: DQN
10
+ results:
11
+ - task:
12
+ type: reinforcement-learning
13
+ name: reinforcement-learning
14
+ dataset:
15
+ name: PongNoFrameskip-v4
16
+ type: PongNoFrameskip-v4
17
+ metrics:
18
+ - type: mean_reward
19
+ value: -21.00 +/- 0.00
20
+ name: mean_reward
21
+ verified: false
22
+ ---
23
+
24
+ # **DQN** Agent playing **PongNoFrameskip-v4**
25
+ This is a trained model of a **DQN** agent playing **PongNoFrameskip-v4**
26
+ using the [stable-baselines3 library](https://github.com/DLR-RM/stable-baselines3).
27
+
28
+ ## Usage (with Stable-baselines3)
29
+ TODO: Add your code
30
+
31
+
32
+ ```python
33
+ from stable_baselines3 import ...
34
+ from huggingface_sb3 import load_from_hub
35
+
36
+ ...
37
+ ```
config.json ADDED
The diff for this file is too large to render. See raw diff
 
pong-dqn.zip ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:4044817a01615e261bf2803928001f510421afacbb1797743cfd11f79f9cbc2a
3
+ size 13754324
pong-dqn/_stable_baselines3_version ADDED
@@ -0,0 +1 @@
 
 
1
+ 2.6.0
pong-dqn/data ADDED
The diff for this file is too large to render. See raw diff
 
pong-dqn/policy.optimizer.pth ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:215ea7d8898faa9284464c7109532cf36390d330fccab8d77eeba20628a32876
3
+ size 1120
pong-dqn/policy.pth ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:a122bf9fa86a63401646a464328a308529f69e579242d817a62b432603dafc8a
3
+ size 13505178
pong-dqn/pytorch_variables.pth ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:0c35cea3b2e60fb5e7e162d3592df775cd400e575a31c72f359fb9e654ab00c5
3
+ size 864
pong-dqn/system_info.txt ADDED
@@ -0,0 +1,9 @@
 
 
 
 
 
 
 
 
 
 
1
+ - OS: Linux-6.1.123+-x86_64-with-glibc2.35 # 1 SMP PREEMPT_DYNAMIC Sun Mar 30 16:01:29 UTC 2025
2
+ - Python: 3.11.12
3
+ - Stable-Baselines3: 2.6.0
4
+ - PyTorch: 2.6.0+cu124
5
+ - GPU Enabled: False
6
+ - Numpy: 2.0.2
7
+ - Cloudpickle: 3.1.1
8
+ - Gymnasium: 1.1.1
9
+ - OpenAI Gym: 0.25.2
results.json ADDED
@@ -0,0 +1 @@
 
 
1
+ {"mean_reward": -21.0, "std_reward": 0.0, "is_deterministic": false, "n_eval_episodes": 10, "eval_datetime": "2025-05-01T00:00:18.628583"}