jin-kwon
/

pakdd_tag_comcts

Generated from Trainer

Model card Files Files and versions

jin-kwon commited on Nov 9, 2025

Commit

9596b39

·

verified ·

1 Parent(s): ac0edd2

Training in progress, step 50

Files changed (2) hide show

README.md +2 -2
adapter_model.safetensors +1 -1

README.md CHANGED Viewed

@@ -3,8 +3,8 @@ library_name: transformers
 model_name: pakdd_tag_comcts
 tags:
 - generated_from_trainer
-- grpo
 - trl
 licence: license
 ---
@@ -26,7 +26,7 @@ print(output["generated_text"])
 ## Training procedure
-[<img src="https://raw.githubusercontent.com/wandb/assets/main/wandb-github-badge-28.svg" alt="Visualize in Weights & Biases" width="150" height="24"/>](https://wandb.ai/jylee18/tag/runs/vwdyhn31)
 This model was trained with GRPO, a method introduced in [DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models](https://huggingface.co/papers/2402.03300).

 model_name: pakdd_tag_comcts
 tags:
 - generated_from_trainer
 - trl
+- grpo
 licence: license
 ---
 ## Training procedure
+[<img src="https://raw.githubusercontent.com/wandb/assets/main/wandb-github-badge-28.svg" alt="Visualize in Weights & Biases" width="150" height="24"/>](https://wandb.ai/jylee18/tag/runs/ri6qhezt)
 This model was trained with GRPO, a method introduced in [DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models](https://huggingface.co/papers/2402.03300).

adapter_model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:73ebb3cb1f6aeb68bea55eb6b238122eaaa04ac5ee8b98bfa2e6279640686cbf
 size 11817664

 version https://git-lfs.github.com/spec/v1
+oid sha256:1d6d0f4c532f4b2cb7bf72598f292ed7ca1cc217f8d21c78039113d65929a1b4
 size 11817664