kangdawei commited on
Commit
1ba911a
·
verified ·
1 Parent(s): 7fc43ed

Training in progress, step 400

Browse files
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:75e163f10c5e4b65bb1e8f051f9650a19bad9c70b4aa270505d20693f9563bb7
3
  size 3554214752
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:9de95d92260d4d76f4bb3296d7aacbbd54c54b3e6b361e6443f15a9f8b931706
3
  size 3554214752
reward_data/all_rewards.csv CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:7a8ab3fc6f54927d43a927475f501f453a319ace903400b2a84157faed0f652b
3
- size 305849475
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:f28321fc2fab665504e3ade05cb90f58450fefbbc744d8717cb82cd0a8060a64
3
+ size 347278460
reward_plots/advantage_plot_step_350.png ADDED
reward_plots/advantage_plot_step_360.png ADDED
reward_plots/advantage_plot_step_370.png ADDED
reward_plots/advantage_plot_step_380.png ADDED
reward_plots/advantage_plot_step_390.png ADDED
reward_plots/reward_comparison_step_350.png ADDED
reward_plots/reward_comparison_step_360.png ADDED
reward_plots/reward_comparison_step_370.png ADDED
reward_plots/reward_comparison_step_380.png ADDED
reward_plots/reward_comparison_step_390.png ADDED