RL-lander / DQN-1e6 /policy.pth
LinasKo's picture
It seems that DQN performs the worst if trained for 1e6 timesteps. But it did train quicker, taking about 17 min, as opposed to 20-22.
3f14e63
This file is stored with Xet . It is too big to display, but you can still download it.

Xet Pointer Details

( Raw pointer file )
Xet hash:
391506ea552cd044f9f6d04584a46fe2b451e7630b49624bb0f527c5ac4e097b
Size of remote file:
44 kB
·
SHA256:
ac8d5f020131e906a9b809bbb9f3cfa00be73acbeb313a2a43759e58927ffad1

Xet efficiently stores Large Files inside Git, intelligently splitting files into unique chunks and accelerating uploads and downloads. More info.