ImaghT
/

reinforce-Pixelcopter-PLE-v0

Reinforcement Learning

Pixelcopter-PLE-v0

deep-reinforcement-learning

policy-gradient

Eval Results (legacy)

Model card Files Files and versions

ImaghT commited on 27 days ago

Commit

7abc809

·

verified ·

1 Parent(s): bd18536

Update README.md

Files changed (1) hide show

README.md +6 -4

README.md CHANGED Viewed

@@ -36,15 +36,17 @@ REINFORCE is a policy gradient method that:
 ## Something To Say
-- Reach PLE (0.0.1) through trial and error with SSH Key on (https://github.com/ntasfi/PyGame-Learning-Environment)
-- PixelCopter needs to be wrapped with ```gymnasium.spaces``` in ```Unit 4_2.py```
-- Continue training 20k steps with ```Unit 4_2_continue.py``` after 40k steps in ```Unit 4_2.py```
 - Running time Reference: **3h15min** (40k steps)
-- Wish you a good time~~~
 ## Evaluation Results

 ## Something To Say
+- 😤Reach PLE (0.0.1) through trial and error with SSH Key on (https://github.com/ntasfi/PyGame-Learning-Environment)
+- 😭Evaluate 100 turns to get a relatively low score
+- 💡PixelCopter is wrapped with ```gymnasium.spaces``` in ```Unit 4_2.py```
+- 🙂Continue training 20k steps with ```Unit 4_2_continue.py``` after 40k steps in ```Unit 4_2.py```
 - Running time Reference: **3h15min** (40k steps)
+- ☀️Wish you a good time~~~
 ## Evaluation Results