Training in progress, step 3

Files changed (3) hide show

README.md CHANGED Viewed

@@ -28,7 +28,7 @@ print(output["generated_text"])
 ## Training procedure
-[<img src="https://raw.githubusercontent.com/wandb/assets/main/wandb-github-badge-28.svg" alt="Visualize in Weights & Biases" width="150" height="24"/>](https://wandb.ai/goyalayuss-iit-roorkee/wordle-research/runs/wordle_clean_smoke_20260321_222858-rl-mixed_rl-b1f02dc0b8)
 This model was trained with GRPO, a method introduced in [DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models](https://huggingface.co/papers/2402.03300).

 ## Training procedure
+[<img src="https://raw.githubusercontent.com/wandb/assets/main/wandb-github-badge-28.svg" alt="Visualize in Weights & Biases" width="150" height="24"/>](https://wandb.ai/goyalayuss-iit-roorkee/wordle-research/runs/wordle_full_validation_20260322-164825-rl-mixed_rl-f10fa460e8)
 This model was trained with GRPO, a method introduced in [DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models](https://huggingface.co/papers/2402.03300).

adapter_model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:25d887cc3465a171b3e4c050d48e646dc5d0aa3c84e8a913e8baa0de15b3c0af
 size 84962944

 version https://git-lfs.github.com/spec/v1
+oid sha256:da11663eb9da0fa82dea54ea374c872e86182cac939b3f1c8fa60726bcbd5c0b
 size 84962944

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:97b9ebd429a3452d4eb8a90127debf802b054906308e9a4a5feec63ca871d3ad
 size 7697

 version https://git-lfs.github.com/spec/v1
+oid sha256:4f0b4d57ded92f95aa5e116f68cbf4d3df92e07b262260a704968f80f016f3a7
 size 7697