kangdawei commited on
Commit
00641c3
·
verified ·
1 Parent(s): f97beb1

Training in progress, step 350

Browse files
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:48327957b258de6109ed96446f16ad2b0002a03a0f52dedead96cd0efa46c15e
3
  size 3554214752
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:e1f5e3110dc0008c5ef78698017b3d7aec90421e23d2c4f6365756e8fdb71d8f
3
  size 3554214752
reward_data/all_rewards.csv CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:72fabd77f8e2b4825cf4d4314f01a9b0be0850d38381804098fa2eb85cdb4ee0
3
- size 264691050
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:e218cf2fe6f94dd7d66fc174fe5d4c6edb21788e15445fe13ad635325a659d36
3
+ size 310444766
reward_plots/advantage_plot_step_300.png ADDED
reward_plots/advantage_plot_step_310.png ADDED
reward_plots/advantage_plot_step_320.png ADDED
reward_plots/advantage_plot_step_330.png ADDED
reward_plots/advantage_plot_step_340.png ADDED
reward_plots/reward_comparison_step_300.png ADDED
reward_plots/reward_comparison_step_310.png ADDED
reward_plots/reward_comparison_step_320.png ADDED
reward_plots/reward_comparison_step_330.png ADDED
reward_plots/reward_comparison_step_340.png ADDED