kangdawei commited on
Commit
55497ac
·
verified ·
1 Parent(s): 00a1151

Training in progress, step 200

Browse files
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:a22a5c7b27dfc7df8e04b7ed47e50930ccf86a4dc4dd3289dd414f337836476b
3
  size 3554214752
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:18aafb649812e9813bb3b36b569d80b40f236ba6c523e36e2a1f1a59b260c913
3
  size 3554214752
reward_data/all_rewards.csv CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:c191935150ca8cbbd5b6be030e2fa30c3185246fbde615cf6943f9e2a6ec7968
3
- size 125392538
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:4c17f16bb3da68c0bdc143c41bd785b68e5e8dc1497dcf155b27ddf80f661755
3
+ size 162284378
reward_plots/advantage_plot_step_150.png ADDED
reward_plots/advantage_plot_step_160.png ADDED
reward_plots/advantage_plot_step_170.png ADDED
reward_plots/advantage_plot_step_180.png ADDED
reward_plots/advantage_plot_step_190.png ADDED
reward_plots/reward_comparison_step_150.png ADDED
reward_plots/reward_comparison_step_160.png ADDED
reward_plots/reward_comparison_step_170.png ADDED
reward_plots/reward_comparison_step_180.png ADDED
reward_plots/reward_comparison_step_190.png ADDED