kangdawei commited on
Commit
ac7eb4d
·
verified ·
1 Parent(s): ed9f395

Training in progress, step 500

Browse files
adapter_model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:8162e14b21dac2bf3c5f3c399b4ff70feb92461736b15c693101a61c5e38c70b
3
  size 335605144
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:50a6906fd10873e34d4c0b960b202b89152ed245737727bd7aaae83f0db953a1
3
  size 335605144
reward_data/all_rewards.csv CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:a79cbc45a892570ebc272573bed08da14826fcc86b960e7d4b4082fecb0916d6
3
- size 24500302
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:3dcaec84d7143f8c39e377ff6e2136372653ab8fcb38c70434d08a86424d3d42
3
+ size 26966971
reward_plots/advantage_plot_step_450.png ADDED
reward_plots/advantage_plot_step_460.png ADDED
reward_plots/advantage_plot_step_470.png ADDED
reward_plots/advantage_plot_step_480.png ADDED
reward_plots/advantage_plot_step_490.png ADDED
reward_plots/reward_comparison_step_450.png ADDED
reward_plots/reward_comparison_step_460.png ADDED
reward_plots/reward_comparison_step_470.png ADDED
reward_plots/reward_comparison_step_480.png ADDED
reward_plots/reward_comparison_step_490.png ADDED