LinasKo
/

RL-lander

Reinforcement Learning

stable-baselines3

deep-reinforcement-learning

Eval Results (legacy)

Model card Files Files and versions

RL-lander / DQN-1e6 /policy.pth

LinasKo's picture

It seems that DQN performs the worst if trained for 1e6 timesteps. But it did train quicker, taking about 17 min, as opposed to 20-22.

3f14e63 over 3 years ago

history blame contribute delete

44 kB

This file is stored with Xet . It is too big to display, but you can still download it.

Xet Pointer Details

( Raw pointer file )

Xet hash:: 391506ea552cd044f9f6d04584a46fe2b451e7630b49624bb0f527c5ac4e097b
Size of remote file:: 44 kB
SHA256:: ac8d5f020131e906a9b809bbb9f3cfa00be73acbeb313a2a43759e58927ffad1

Xet efficiently stores Large Files inside Git, intelligently splitting files into unique chunks and accelerating uploads and downloads. More info.