kangdawei commited on
Commit
975c940
·
verified ·
1 Parent(s): cc32a2b

Training in progress, step 500

Browse files
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:056c7d8f5a80079683798dbd969d36f77ec2ed3a3adc3dc1db5f896b29d65223
3
  size 3554214752
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:0bfb931f2500248ed1fa2d7f6c3773beada9807b73a9e8710e8d5953bf2038e3
3
  size 3554214752
reward_data/all_rewards.csv CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:42f3b07fa012398cc829a0f7e28187df090d76c3886b424c1eaa13d3388616f2
3
- size 305876796
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:dac90dbef13d9e91016e6a79569594905691f7fbc229b4cc9e34d93ce5c41da5
3
+ size 339790878
reward_plots/advantage_plot_step_450.png ADDED
reward_plots/advantage_plot_step_460.png ADDED
reward_plots/advantage_plot_step_470.png ADDED
reward_plots/advantage_plot_step_480.png ADDED
reward_plots/advantage_plot_step_490.png ADDED
reward_plots/reward_comparison_step_450.png ADDED
reward_plots/reward_comparison_step_460.png ADDED
reward_plots/reward_comparison_step_470.png ADDED
reward_plots/reward_comparison_step_480.png ADDED
reward_plots/reward_comparison_step_490.png ADDED