goyalayus commited on
Commit
7db6f4d
·
verified ·
1 Parent(s): 5ae51f5

Training in progress, step 3

Browse files
Files changed (3) hide show
  1. README.md +1 -1
  2. adapter_model.safetensors +1 -1
  3. training_args.bin +1 -1
README.md CHANGED
@@ -28,7 +28,7 @@ print(output["generated_text"])
28
 
29
  ## Training procedure
30
 
31
- [<img src="https://raw.githubusercontent.com/wandb/assets/main/wandb-github-badge-28.svg" alt="Visualize in Weights & Biases" width="150" height="24"/>](https://wandb.ai/goyalayuss-iit-roorkee/wordle-research/runs/wordle_clean_smoke_20260321_222858-rl-mixed_rl-b1f02dc0b8)
32
 
33
 
34
  This model was trained with GRPO, a method introduced in [DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models](https://huggingface.co/papers/2402.03300).
 
28
 
29
  ## Training procedure
30
 
31
+ [<img src="https://raw.githubusercontent.com/wandb/assets/main/wandb-github-badge-28.svg" alt="Visualize in Weights & Biases" width="150" height="24"/>](https://wandb.ai/goyalayuss-iit-roorkee/wordle-research/runs/wordle_full_validation_20260322-164825-rl-mixed_rl-f10fa460e8)
32
 
33
 
34
  This model was trained with GRPO, a method introduced in [DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models](https://huggingface.co/papers/2402.03300).
adapter_model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:25d887cc3465a171b3e4c050d48e646dc5d0aa3c84e8a913e8baa0de15b3c0af
3
  size 84962944
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:da11663eb9da0fa82dea54ea374c872e86182cac939b3f1c8fa60726bcbd5c0b
3
  size 84962944
training_args.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:97b9ebd429a3452d4eb8a90127debf802b054906308e9a4a5feec63ca871d3ad
3
  size 7697
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:4f0b4d57ded92f95aa5e116f68cbf4d3df92e07b262260a704968f80f016f3a7
3
  size 7697