Thrillcrazyer commited on
Commit
35a7376
·
verified ·
1 Parent(s): 8e17b3d

Training in progress, step 100

Browse files
README.md CHANGED
@@ -27,7 +27,7 @@ print(output["generated_text"])
27
 
28
  ## Training procedure
29
 
30
- [<img src="https://raw.githubusercontent.com/wandb/assets/main/wandb-github-badge-28.svg" alt="Visualize in Weights & Biases" width="150" height="24"/>](https://wandb.ai/pthpark1/THIP_COMPARE_QWEN7/runs/d1zx0kgj)
31
 
32
 
33
  This model was trained with GRPO, a method introduced in [DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models](https://huggingface.co/papers/2402.03300).
 
27
 
28
  ## Training procedure
29
 
30
+ [<img src="https://raw.githubusercontent.com/wandb/assets/main/wandb-github-badge-28.svg" alt="Visualize in Weights & Biases" width="150" height="24"/>](https://wandb.ai/pthpark1/THIP_COMPARE_QWEN7/runs/lkg20u9d)
31
 
32
 
33
  This model was trained with GRPO, a method introduced in [DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models](https://huggingface.co/papers/2402.03300).
model-00001-of-00004.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:c0ecae4ad9751ccc489a7d865c48a77b69e40ee4abe245e70c998d5673a029e3
3
  size 4877660776
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:4ae0fe3582623804fe214fb554492322c139185e0c3de7b1bf32e8da83459fb8
3
  size 4877660776
model-00002-of-00004.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:916fd26a2be7b30e7190895a0812b5d1f030c13311cd4f7ad855e0af945c12c7
3
  size 4932751008
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:853e8e6623a1b76099c6004902f09e973c52df29fb36b1f3e8f23e565291e2a0
3
  size 4932751008
model-00003-of-00004.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:3762fcd10383b02926d063b06bf0a16adf3973f3114a67edba7632569a1298d1
3
  size 4330865200
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:55821d5ec125e861e8526c1df0c053ce3a222e61adbe8bf4007e7b51937aff59
3
  size 4330865200
model-00004-of-00004.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:11d20d002e4f12d260e3686e4c85c09d69c4373ec6aecd1f5213de21d221d9f4
3
  size 1089994880
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:5267cdc4dd96056cde9a596b636a3e932edfa5b34aede0469b0416bfcf0be7a4
3
  size 1089994880
training_args.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:9027d84f36e545e6abba7c5be551a7d0c84c6b6daaff480dafc3118ff658716b
3
  size 8657
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:df19fd498849fc116f86d81d9368bfedb0955e647cd072d05f2306b9af1a13d2
3
  size 8657