Attila Kanto
commited on
Commit
·
73a98b8
1
Parent(s):
c9a7658
Update README.md: add 25M steps checkpoint and adjust training timesteps
Browse files
README.md
CHANGED
|
@@ -28,10 +28,10 @@ This is a PPO agent trained using Stable Baselines3 and Gymnasium on a Mario-lik
|
|
| 28 |
|
| 29 |
## Training Timesteps & Checkpoints
|
| 30 |
|
| 31 |
-
| Checkpoint
|
| 32 |
-
|
|
| 33 |
-
| [
|
| 34 |
-
| [50M Steps](checkpoints/simple/50M_steps/mario_ppo.zip)
|
| 35 |
|
| 36 |
## Usage
|
| 37 |
|
|
|
|
| 28 |
|
| 29 |
## Training Timesteps & Checkpoints
|
| 30 |
|
| 31 |
+
| Checkpoint | Timesteps | Notes |
|
| 32 |
+
| ---------------------------------------------------------------- | ---------- | -------------------- |
|
| 33 |
+
| [25M Steps](checkpoints/simple/25M_steps/mario_ppo_25000000.zip) | 25,000,000 | Early-stage learning |
|
| 34 |
+
| [50M Steps](checkpoints/simple/50M_steps/mario_ppo.zip) | 50,000,000 | Better stability |
|
| 35 |
|
| 36 |
## Usage
|
| 37 |
|
checkpoints/simple/25M_steps/mario_ppo_25000000.zip
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:63e083df68672c3ee1a4aa14cffe2de546305e2c61fa0b7c6ad34b20db578786
|
| 3 |
+
size 289910411
|