Update README.md
Browse files
README.md
CHANGED
|
@@ -37,4 +37,4 @@ Second, it uses Experience Replay.
|
|
| 37 |
|
| 38 |
We store list of tuples (state, action, reward, next_state), and instead of learning only from recent experience, we learn from sampling all of our experience accumulated so far.
|
| 39 |
|
| 40 |
-
[pendulum_gif](https://imgur.com/eEH8Cz6)
|
|
|
|
| 37 |
|
| 38 |
We store list of tuples (state, action, reward, next_state), and instead of learning only from recent experience, we learn from sampling all of our experience accumulated so far.
|
| 39 |
|
| 40 |
+

|