kangdawei commited on
Commit
f97beb1
·
verified ·
1 Parent(s): 9e4d606

Training in progress, step 300

Browse files
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:5ecc59a9752d7255caf05550ed7acc617ee8582df4268998b402c88b71c37a34
3
  size 3554214752
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:48327957b258de6109ed96446f16ad2b0002a03a0f52dedead96cd0efa46c15e
3
  size 3554214752
reward_data/all_rewards.csv CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:598d273d918f1ec949e1a0a72ea9680ea9343d65ab338c44052ce6bff5419d85
3
- size 216539631
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:72fabd77f8e2b4825cf4d4314f01a9b0be0850d38381804098fa2eb85cdb4ee0
3
+ size 264691050
reward_plots/advantage_plot_step_250.png ADDED
reward_plots/advantage_plot_step_260.png ADDED
reward_plots/advantage_plot_step_270.png ADDED
reward_plots/advantage_plot_step_280.png ADDED
reward_plots/advantage_plot_step_290.png ADDED
reward_plots/reward_comparison_step_250.png ADDED
reward_plots/reward_comparison_step_260.png ADDED
reward_plots/reward_comparison_step_270.png ADDED
reward_plots/reward_comparison_step_280.png ADDED
reward_plots/reward_comparison_step_290.png ADDED