Training in progress, step 100

Files changed (6) hide show

README.md CHANGED Viewed

@@ -27,7 +27,7 @@ print(output["generated_text"])
 ## Training procedure
-[<img src="https://raw.githubusercontent.com/wandb/assets/main/wandb-github-badge-28.svg" alt="Visualize in Weights & Biases" width="150" height="24"/>](https://wandb.ai/pthpark1/THIP_COMPARE_QWEN7/runs/d1zx0kgj)
 This model was trained with GRPO, a method introduced in [DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models](https://huggingface.co/papers/2402.03300).

 ## Training procedure
+[<img src="https://raw.githubusercontent.com/wandb/assets/main/wandb-github-badge-28.svg" alt="Visualize in Weights & Biases" width="150" height="24"/>](https://wandb.ai/pthpark1/THIP_COMPARE_QWEN7/runs/lkg20u9d)
 This model was trained with GRPO, a method introduced in [DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models](https://huggingface.co/papers/2402.03300).

model-00001-of-00004.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:c0ecae4ad9751ccc489a7d865c48a77b69e40ee4abe245e70c998d5673a029e3
 size 4877660776

 version https://git-lfs.github.com/spec/v1
+oid sha256:4ae0fe3582623804fe214fb554492322c139185e0c3de7b1bf32e8da83459fb8
 size 4877660776

model-00002-of-00004.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:916fd26a2be7b30e7190895a0812b5d1f030c13311cd4f7ad855e0af945c12c7
 size 4932751008

 version https://git-lfs.github.com/spec/v1
+oid sha256:853e8e6623a1b76099c6004902f09e973c52df29fb36b1f3e8f23e565291e2a0
 size 4932751008

model-00003-of-00004.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:3762fcd10383b02926d063b06bf0a16adf3973f3114a67edba7632569a1298d1
 size 4330865200

 version https://git-lfs.github.com/spec/v1
+oid sha256:55821d5ec125e861e8526c1df0c053ce3a222e61adbe8bf4007e7b51937aff59
 size 4330865200

model-00004-of-00004.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:11d20d002e4f12d260e3686e4c85c09d69c4373ec6aecd1f5213de21d221d9f4
 size 1089994880

 version https://git-lfs.github.com/spec/v1
+oid sha256:5267cdc4dd96056cde9a596b636a3e932edfa5b34aede0469b0416bfcf0be7a4
 size 1089994880

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:9027d84f36e545e6abba7c5be551a7d0c84c6b6daaff480dafc3118ff658716b
 size 8657

 version https://git-lfs.github.com/spec/v1
+oid sha256:df19fd498849fc116f86d81d9368bfedb0955e647cd072d05f2306b9af1a13d2
 size 8657