kangdawei commited on
Commit
3e03734
·
verified ·
1 Parent(s): a77d33f

Training in progress, step 450

Browse files
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:a2676115b5d2c4721dffdbd70daa9a4b12a00bbc27ed8bdb04dfee789cd47449
3
  size 3554214752
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:f702f57859188679c4525beb909f9898a8f6cee927fa0e1da6cc7364eeaa4937
3
  size 3554214752
reward_data/all_rewards.csv CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:1f91b4bb4a12106dc63c03e01c9c5fc1a52a8e34487eb374d2929c2516dc3b74
3
- size 247098682
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:802417baa349cee586941c622de92fefd916a4847e149b22e90fc93554499308
3
+ size 271135598
reward_plots/advantage_plot_step_400.png ADDED
reward_plots/advantage_plot_step_410.png ADDED
reward_plots/advantage_plot_step_420.png ADDED
reward_plots/advantage_plot_step_430.png ADDED
reward_plots/advantage_plot_step_440.png ADDED
reward_plots/reward_comparison_step_400.png ADDED
reward_plots/reward_comparison_step_410.png ADDED
reward_plots/reward_comparison_step_420.png ADDED
reward_plots/reward_comparison_step_430.png ADDED
reward_plots/reward_comparison_step_440.png ADDED