roaminwind
/

Qwen2.5-1.5B-Open-R1-Distill-2kdata

Generated from Trainer

Model card Files Files and versions

Update README.md

#2

by roaminwind - opened Apr 14, 2025

base: refs/heads/main

←

from: refs/pr/2

Discussion Files changed

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -11,7 +11,7 @@ tags:
 licence: license
 ---
-# Model Card for Qwen2.5-1.5B-Open-R1-Distill
 This model is a fine-tuned version of [Qwen/Qwen2.5-1.5B-Instruct](https://huggingface.co/Qwen/Qwen2.5-1.5B-Instruct) on the [open-r1/OpenR1-Math-220k](https://huggingface.co/datasets/open-r1/OpenR1-Math-220k) dataset, only 2k of that used.
 It has been trained using [TRL](https://github.com/huggingface/trl).

 licence: license
 ---
+# Model Card for Qwen2.5-1.5B-Open-R1-Distill-2kdata
 This model is a fine-tuned version of [Qwen/Qwen2.5-1.5B-Instruct](https://huggingface.co/Qwen/Qwen2.5-1.5B-Instruct) on the [open-r1/OpenR1-Math-220k](https://huggingface.co/datasets/open-r1/OpenR1-Math-220k) dataset, only 2k of that used.
 It has been trained using [TRL](https://github.com/huggingface/trl).