NTQuoc
/

OpenRS-GRPO

Generated from Trainer

Model card Files Files and versions

NTQuoc commited on about 1 month ago

Commit

d64a0c5

·

verified ·

1 Parent(s): e104d31

End of training

Files changed (1) hide show

README.md +3 -1

README.md CHANGED Viewed

@@ -1,9 +1,11 @@
 ---
 base_model: Qwen/Qwen3.5-0.8B
 library_name: transformers
 model_name: OpenRS-GRPO
 tags:
 - generated_from_trainer
 - trl
 - grpo
 licence: license
@@ -11,7 +13,7 @@ licence: license
 # Model Card for OpenRS-GRPO
-This model is a fine-tuned version of [Qwen/Qwen3.5-0.8B](https://huggingface.co/Qwen/Qwen3.5-0.8B).
 It has been trained using [TRL](https://github.com/huggingface/trl).
 ## Quick start

 ---
 base_model: Qwen/Qwen3.5-0.8B
+datasets: knoveleng/open-rs
 library_name: transformers
 model_name: OpenRS-GRPO
 tags:
 - generated_from_trainer
+- open-r1
 - trl
 - grpo
 licence: license
 # Model Card for OpenRS-GRPO
+This model is a fine-tuned version of [Qwen/Qwen3.5-0.8B](https://huggingface.co/Qwen/Qwen3.5-0.8B) on the [knoveleng/open-rs](https://huggingface.co/datasets/knoveleng/open-rs) dataset.
 It has been trained using [TRL](https://github.com/huggingface/trl).
 ## Quick start