kangdawei commited on
Commit
36b6839
·
verified ·
1 Parent(s): 7bb1e7b

Training in progress, step 150

Browse files
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:7b8828d4fbabdc0dad40ddfca9f71f82175d9c17758c4926328ee63d1fbdddc6
3
  size 3554214752
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:a781e3aef45fa1e809cf060aa100e2170c8f018aed60b767c3ebe4fc56a5d81e
3
  size 3554214752
reward_data/all_rewards.csv CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:b8c31e2a2938e67e072cbe07fad82adf22ac153d4a5cce1484eb81423c09ccab
3
- size 85885521
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:e2ae14ffe97dec71eb500030c0fa8d5f736c14eaf52ef4f6ff4e495b34e5a734
3
+ size 128037689
reward_plots/advantage_plot_step_100.png ADDED
reward_plots/advantage_plot_step_110.png ADDED
reward_plots/advantage_plot_step_120.png ADDED
reward_plots/advantage_plot_step_130.png ADDED
reward_plots/advantage_plot_step_140.png ADDED
reward_plots/reward_comparison_step_100.png ADDED
reward_plots/reward_comparison_step_110.png ADDED
reward_plots/reward_comparison_step_120.png ADDED
reward_plots/reward_comparison_step_130.png ADDED
reward_plots/reward_comparison_step_140.png ADDED