Update README.md
Browse files
README.md
CHANGED
|
@@ -36,15 +36,17 @@ REINFORCE is a policy gradient method that:
|
|
| 36 |
|
| 37 |
|
| 38 |
## Something To Say
|
| 39 |
-
- Reach PLE (0.0.1) through trial and error with SSH Key on (https://github.com/ntasfi/PyGame-Learning-Environment)
|
| 40 |
|
| 41 |
-
-
|
| 42 |
|
| 43 |
-
-
|
|
|
|
|
|
|
| 44 |
|
| 45 |
- Running time Reference: **3h15min** (40k steps)
|
| 46 |
|
| 47 |
-
- Wish you a good time~~~
|
| 48 |
|
| 49 |
## Evaluation Results
|
| 50 |
|
|
|
|
| 36 |
|
| 37 |
|
| 38 |
## Something To Say
|
| 39 |
+
- 😤Reach PLE (0.0.1) through trial and error with SSH Key on (https://github.com/ntasfi/PyGame-Learning-Environment)
|
| 40 |
|
| 41 |
+
- 😭Evaluate 100 turns to get a relatively low score
|
| 42 |
|
| 43 |
+
- 💡PixelCopter is wrapped with ```gymnasium.spaces``` in ```Unit 4_2.py```
|
| 44 |
+
|
| 45 |
+
- 🙂Continue training 20k steps with ```Unit 4_2_continue.py``` after 40k steps in ```Unit 4_2.py```
|
| 46 |
|
| 47 |
- Running time Reference: **3h15min** (40k steps)
|
| 48 |
|
| 49 |
+
- ☀️Wish you a good time~~~
|
| 50 |
|
| 51 |
## Evaluation Results
|
| 52 |
|