kangdawei commited on
Commit
d706c19
·
verified ·
1 Parent(s): 5f94503

Training in progress, step 150

Browse files
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:af892fb5d3e1bf51b7cb8901804fe649e8230f60c8dee6850a3b5880c72f991a
3
  size 3554214752
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:5355ccd0eb312cdcce8e062234d305468b2f159545c407c0fac34c11356b6d48
3
  size 3554214752
reward_data/all_rewards.csv CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:01c3fce594840f394b984b2654efd23adc490d8e86ade02641f0e6e76f21ed7a
3
- size 85996181
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:7282f5ca65e20878085bfab0d0682ac2c44996f6f92db2a67687eac29bc471c8
3
+ size 128977890
reward_plots/advantage_plot_step_100.png ADDED
reward_plots/advantage_plot_step_110.png ADDED
reward_plots/advantage_plot_step_120.png ADDED
reward_plots/advantage_plot_step_130.png ADDED
reward_plots/advantage_plot_step_140.png ADDED
reward_plots/reward_comparison_step_100.png ADDED
reward_plots/reward_comparison_step_110.png ADDED
reward_plots/reward_comparison_step_120.png ADDED
reward_plots/reward_comparison_step_130.png ADDED
reward_plots/reward_comparison_step_140.png ADDED