kangdawei commited on
Commit
0d8f36a
·
verified ·
1 Parent(s): 55497ac

Training in progress, step 250

Browse files
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:18aafb649812e9813bb3b36b569d80b40f236ba6c523e36e2a1f1a59b260c913
3
  size 3554214752
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:764f6317dd2f914c027e8dbce2e76b459a862a5b5a08fbe888afa2cf754caae4
3
  size 3554214752
reward_data/all_rewards.csv CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:4c17f16bb3da68c0bdc143c41bd785b68e5e8dc1497dcf155b27ddf80f661755
3
- size 162284378
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:67a5f0f0592b9d2001d3d2da9fe0ed74c0e35f44022c306351f275659f07b16a
3
+ size 190361816
reward_plots/advantage_plot_step_200.png ADDED
reward_plots/advantage_plot_step_210.png ADDED
reward_plots/advantage_plot_step_220.png ADDED
reward_plots/advantage_plot_step_230.png ADDED
reward_plots/advantage_plot_step_240.png ADDED
reward_plots/reward_comparison_step_200.png ADDED
reward_plots/reward_comparison_step_210.png ADDED
reward_plots/reward_comparison_step_220.png ADDED
reward_plots/reward_comparison_step_230.png ADDED
reward_plots/reward_comparison_step_240.png ADDED