Update README.md

Files changed (1) hide show

README.md CHANGED Viewed

@@ -37,4 +37,4 @@ Second, it uses Experience Replay.
 We store list of tuples (state, action, reward, next_state), and instead of learning only from recent experience, we learn from sampling all of our experience accumulated so far.
-[pendulum_gif](https://imgur.com/eEH8Cz6)


37
38	We store list of tuples (state, action, reward, next_state), and instead of learning only from recent experience, we learn from sampling all of our experience accumulated so far.
39
40	+ ![pendulum_gif](https://imgur.com/eEH8Cz6)