kangdawei commited on
Commit
00a1151
·
verified ·
1 Parent(s): 05eca2a

Training in progress, step 150

Browse files
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:15e6e5f038eeddaeea0d7def151a163439301398b0d72ec5095abce61f3935b7
3
  size 3554214752
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:a22a5c7b27dfc7df8e04b7ed47e50930ccf86a4dc4dd3289dd414f337836476b
3
  size 3554214752
reward_data/all_rewards.csv CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:622947214d474e6898133289a8f93e73d2790fb6a73528579dd08c792941060a
3
- size 84147138
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:c191935150ca8cbbd5b6be030e2fa30c3185246fbde615cf6943f9e2a6ec7968
3
+ size 125392538
reward_plots/advantage_plot_step_100.png ADDED
reward_plots/advantage_plot_step_110.png ADDED
reward_plots/advantage_plot_step_120.png ADDED
reward_plots/advantage_plot_step_130.png ADDED
reward_plots/advantage_plot_step_140.png ADDED
reward_plots/reward_comparison_step_100.png ADDED
reward_plots/reward_comparison_step_110.png ADDED
reward_plots/reward_comparison_step_120.png ADDED
reward_plots/reward_comparison_step_130.png ADDED
reward_plots/reward_comparison_step_140.png ADDED