kangdawei commited on
Commit
cc32a2b
·
verified ·
1 Parent(s): b3603e2

Training in progress, step 450

Browse files
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:2efa942c7dd2572122d8114b01b07b0a465084702f40c9b4a647f20a5f78b337
3
  size 3554214752
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:056c7d8f5a80079683798dbd969d36f77ec2ed3a3adc3dc1db5f896b29d65223
3
  size 3554214752
reward_data/all_rewards.csv CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:d0fffbb7b0488861377b0820bcf46747e00996bfe212d90959c0781dec358e90
3
- size 272830726
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:42f3b07fa012398cc829a0f7e28187df090d76c3886b424c1eaa13d3388616f2
3
+ size 305876796
reward_plots/advantage_plot_step_400.png ADDED
reward_plots/advantage_plot_step_410.png ADDED
reward_plots/advantage_plot_step_420.png ADDED
reward_plots/advantage_plot_step_430.png ADDED
reward_plots/advantage_plot_step_440.png ADDED
reward_plots/reward_comparison_step_400.png ADDED
reward_plots/reward_comparison_step_410.png ADDED
reward_plots/reward_comparison_step_420.png ADDED
reward_plots/reward_comparison_step_430.png ADDED
reward_plots/reward_comparison_step_440.png ADDED