Upload SDPO-train32-alpha0.5-rollout8-lr1e-5-bigmath-Qwen-Qwen3-1.7B/latest_checkpointed_iteration.txt with huggingface_hub
Browse files
SDPO-train32-alpha0.5-rollout8-lr1e-5-bigmath-Qwen-Qwen3-1.7B/latest_checkpointed_iteration.txt
ADDED
|
@@ -0,0 +1 @@
|
|
|
|
|
|
|
| 1 |
+
80
|