kangdawei commited on
Commit
ec26ece
·
verified ·
1 Parent(s): d706c19

Training in progress, step 200

Browse files
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:5355ccd0eb312cdcce8e062234d305468b2f159545c407c0fac34c11356b6d48
3
  size 3554214752
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:931284186cecda154441e056d5923888b700adeaa4f38cffa20313911b19ce55
3
  size 3554214752
reward_data/all_rewards.csv CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:7282f5ca65e20878085bfab0d0682ac2c44996f6f92db2a67687eac29bc471c8
3
- size 128977890
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:191b69fb53df9b14fc8339e9f0e34930f6aff3004391bb74c4e2dfe3db4c9ccd
3
+ size 172172545
reward_plots/advantage_plot_step_150.png ADDED
reward_plots/advantage_plot_step_160.png ADDED
reward_plots/advantage_plot_step_170.png ADDED
reward_plots/advantage_plot_step_180.png ADDED
reward_plots/advantage_plot_step_190.png ADDED
reward_plots/reward_comparison_step_150.png ADDED
reward_plots/reward_comparison_step_160.png ADDED
reward_plots/reward_comparison_step_170.png ADDED
reward_plots/reward_comparison_step_180.png ADDED
reward_plots/reward_comparison_step_190.png ADDED