kangdawei commited on
Commit
2b43231
·
verified ·
1 Parent(s): b680f9a

Training in progress, step 200

Browse files
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:165a4933ea8117196d21c518385375e7abba43cb5243e756a792761f63635278
3
  size 3554214752
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:26912a8c589245f7b9d26c16a78b86201c491af8c377f916486b3b48cab0ab0e
3
  size 3554214752
reward_data/all_rewards.csv CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:ec5a5fa1afa037c8a43b4c8eedeb3834816700fc555ee72a7eaa67c1cc854975
3
- size 128751357
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:93c71118bfd916d7c335741067d24bb812ca676c243e5afadd926e466d89e980
3
+ size 172055996
reward_plots/advantage_plot_step_150.png ADDED
reward_plots/advantage_plot_step_160.png ADDED
reward_plots/advantage_plot_step_170.png ADDED
reward_plots/advantage_plot_step_180.png ADDED
reward_plots/advantage_plot_step_190.png ADDED
reward_plots/reward_comparison_step_150.png ADDED
reward_plots/reward_comparison_step_160.png ADDED
reward_plots/reward_comparison_step_170.png ADDED
reward_plots/reward_comparison_step_180.png ADDED
reward_plots/reward_comparison_step_190.png ADDED