kangdawei commited on
Commit
d69570a
·
verified ·
1 Parent(s): 7c7c84d

Training in progress, step 175

Browse files
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:cc7f194e364ffe357e31ee9bd74ed3b837fb94f210fa76a682bac7e01b2b56fd
3
  size 3554214752
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:e27e62ca010c2e03600a7c998ab55ef79bc4f2eb0d5a3530563386a6fd410dee
3
  size 3554214752
reward_data/all_rewards.csv CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:e9a5e1c8379c6893b5725dd63d388dd5edd4d7f84bb4ed4c551bb191debb6e93
3
- size 52400124
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:3db690573d7fe991551ce03d13083b3b8cde23b1728f27a0faf397830aab7ff0
3
+ size 63177579
reward_plots/advantage_plot_step_130.png ADDED
reward_plots/advantage_plot_step_140.png ADDED
reward_plots/advantage_plot_step_150.png ADDED
reward_plots/advantage_plot_step_160.png ADDED
reward_plots/advantage_plot_step_170.png ADDED
training_args.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:76139b0b38c24dfe680d34aab860a4131d545f67793c82c28262cce68bb49810
3
  size 8440
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:541ad8a10b1e5b2e478a4179066a576ea08c7f01ec5162c968fdd06d4d374cb7
3
  size 8440