kangdawei commited on
Commit
1af649f
·
verified ·
1 Parent(s): 970e43f

Training in progress, step 200

Browse files
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:0f1087d4f178d34928d1915582eb66a2b60c6ee97cd003dc5f3e84b39d0975f8
3
  size 3554214752
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:8029401ea283be9fa3d1f5b7aea98d13aca40254349b8f5b58c6830f3099a6fd
3
  size 3554214752
reward_data/all_rewards.csv CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:fcc337f376062712e6865c467ee888457e8eb631b9224dbc74deb7db0efe8f71
3
- size 121408880
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:55eb6539dd600ef46b2e88d1dd4a26c60f05a245ce3387b019efa43529ea6309
3
+ size 153894105
reward_plots/advantage_plot_step_150.png ADDED
reward_plots/advantage_plot_step_160.png ADDED
reward_plots/advantage_plot_step_170.png ADDED
reward_plots/advantage_plot_step_180.png ADDED
reward_plots/advantage_plot_step_190.png ADDED
reward_plots/reward_comparison_step_150.png ADDED
reward_plots/reward_comparison_step_160.png ADDED
reward_plots/reward_comparison_step_170.png ADDED
reward_plots/reward_comparison_step_180.png ADDED
reward_plots/reward_comparison_step_190.png ADDED