kangdawei commited on
Commit
4b7cf2d
·
verified ·
1 Parent(s): a2b001e

Training in progress, step 500

Browse files
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:683290c132aa9806f89e81273f3326bd44180722550852804f35dda6a79a9b2e
3
  size 3554214752
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:b5463610f8d75d4c52613a782e6e9f132e126a632103e199333385fb6a55e595
3
  size 3554214752
reward_data/all_rewards.csv CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:fb9d4a5f0601bd3b2eb5d5cb76928d91d35bd698255372126f3928157ca554b4
3
- size 293098458
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:d2af2fa1c7f249471a1a4f732ee6d5efdcf28897b3fa6b15eca0be9df546a80f
3
+ size 318451553
reward_plots/advantage_plot_step_450.png ADDED
reward_plots/advantage_plot_step_460.png ADDED
reward_plots/advantage_plot_step_470.png ADDED
reward_plots/advantage_plot_step_480.png ADDED
reward_plots/advantage_plot_step_490.png ADDED
reward_plots/reward_comparison_step_450.png ADDED
reward_plots/reward_comparison_step_460.png ADDED
reward_plots/reward_comparison_step_470.png ADDED
reward_plots/reward_comparison_step_480.png ADDED
reward_plots/reward_comparison_step_490.png ADDED