kangdawei commited on
Commit
77d5590
·
verified ·
1 Parent(s): 74fe1a9

Training in progress, step 500

Browse files
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:463c934d4d895615a228a4c60306a4962ba9ca3c547ad066596af8158e58d4c8
3
  size 3554214752
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:969169500044001ff85ebc652cf0bebebe5835e493cf08f34cce4372a012c67e
3
  size 3554214752
reward_data/all_rewards.csv CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:d1de6a3bb3d029992ea4c7c8f720052c644ee1271d7db173b43d84df1920c2fa
3
- size 360978041
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:5d9dd22abf4b78720a54d852caa5e5549c91c73ed6543f51f6dd684920dad078
3
+ size 363764427
reward_plots/advantage_plot_step_450.png ADDED
reward_plots/advantage_plot_step_460.png ADDED
reward_plots/advantage_plot_step_470.png ADDED
reward_plots/advantage_plot_step_480.png ADDED
reward_plots/advantage_plot_step_490.png ADDED
reward_plots/reward_comparison_step_450.png ADDED
reward_plots/reward_comparison_step_460.png ADDED
reward_plots/reward_comparison_step_470.png ADDED
reward_plots/reward_comparison_step_480.png ADDED
reward_plots/reward_comparison_step_490.png ADDED