YangZhoumill
/

Qwen2.5-1.5B-Open-R1-Distill

Text Generation

Generated from Trainer

text-generation-inference

Model card Files Files and versions

YangZhoumill commited on Mar 24, 2025

Commit

08b8982

·

verified ·

1 Parent(s): 3d86a5d

End of training

Files changed (2) hide show

README.md +5 -3
training_args.bin +1 -1

README.md CHANGED Viewed

@@ -1,17 +1,19 @@
 ---
 base_model: Qwen/Qwen2.5-0.5B-Instruct
 library_name: transformers
-model_name: Qwen2.5-1.5B-Open-R1-Distill
 tags:
 - generated_from_trainer
 - trl
 - sft
 licence: license
 ---
-# Model Card for Qwen2.5-1.5B-Open-R1-Distill
-This model is a fine-tuned version of [Qwen/Qwen2.5-0.5B-Instruct](https://huggingface.co/Qwen/Qwen2.5-0.5B-Instruct).
 It has been trained using [TRL](https://github.com/huggingface/trl).
 ## Quick start

 ---
 base_model: Qwen/Qwen2.5-0.5B-Instruct
+datasets: YangZhoumill/post_v_1
 library_name: transformers
+model_name: Qwen2.5-1.5B-Open-R1-Distill-3240658
 tags:
 - generated_from_trainer
+- open-r1
 - trl
 - sft
 licence: license
 ---
+# Model Card for Qwen2.5-1.5B-Open-R1-Distill-3240658
+This model is a fine-tuned version of [Qwen/Qwen2.5-0.5B-Instruct](https://huggingface.co/Qwen/Qwen2.5-0.5B-Instruct) on the [YangZhoumill/post_v_1](https://huggingface.co/datasets/YangZhoumill/post_v_1) dataset.
 It has been trained using [TRL](https://github.com/huggingface/trl).
 ## Quick start

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:cd537ffcc6c893119ccf93412459c7d08d780fad51132960e1a4e2057f8c0085
 size 6072

 version https://git-lfs.github.com/spec/v1
+oid sha256:6e31f77132d9a2ffedf8d30d23fe440d59e1644597d3ad6dde22265d232c7233
 size 6072