kangdawei commited on
Commit
d73bc1a
·
verified ·
1 Parent(s): 182274c

Training in progress, step 500

Browse files
adapter_model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:4b7a99b11581bab590d0ace5a6f477c811b97c6a3977edb275fb2edde358c86c
3
  size 323014560
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:4af89ba14c8d8e3d72cf3339d9bc6d028a8f792977913eed331631eb2217f5ff
3
  size 323014560
reward_data/all_rewards.csv CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:37880f91db94ed6c2e528c86ecc56117df2f6d8b037716625b663a769d48e225
3
- size 21279718
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:b3a50294d239f71aacc00241db9f4b353de52cd54cb7c97c31783b3e7fe302f3
3
+ size 23378556
reward_plots/advantage_plot_step_450.png ADDED
reward_plots/advantage_plot_step_460.png ADDED
reward_plots/advantage_plot_step_470.png ADDED
reward_plots/advantage_plot_step_480.png ADDED
reward_plots/advantage_plot_step_490.png ADDED
reward_plots/reward_comparison_step_450.png ADDED
reward_plots/reward_comparison_step_460.png ADDED
reward_plots/reward_comparison_step_470.png ADDED
reward_plots/reward_comparison_step_480.png ADDED
reward_plots/reward_comparison_step_490.png ADDED