kangdawei commited on
Commit
ac290e1
·
verified ·
1 Parent(s): 4539c5e

Training in progress, step 350

Browse files
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:e17a98caa839c1a10fb462aec857a3bd4fb78f019e7c9840d59ceb941f5c2303
3
  size 3554214752
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:e128f46ed4d7e95715f587097307e28f80acf2f80b88a5c920c35fc8c6147fa2
3
  size 3554214752
reward_data/all_rewards.csv CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:7f2a9034042ca29ab389febadd48920223e36deae491f0d1ebb6cf0485dc995b
3
- size 261382273
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:e4e951174948e2fc8cd7ba41e58bf03cf7d5a7151cf92f974c84c73b34ffb39f
3
+ size 306523002
reward_plots/advantage_plot_step_300.png ADDED
reward_plots/advantage_plot_step_310.png ADDED
reward_plots/advantage_plot_step_320.png ADDED
reward_plots/advantage_plot_step_330.png ADDED
reward_plots/advantage_plot_step_340.png ADDED
reward_plots/reward_comparison_step_300.png ADDED
reward_plots/reward_comparison_step_310.png ADDED
reward_plots/reward_comparison_step_320.png ADDED
reward_plots/reward_comparison_step_330.png ADDED
reward_plots/reward_comparison_step_340.png ADDED