YangZhoumill commited on
Commit
08b8982
·
verified ·
1 Parent(s): 3d86a5d

End of training

Browse files
Files changed (2) hide show
  1. README.md +5 -3
  2. training_args.bin +1 -1
README.md CHANGED
@@ -1,17 +1,19 @@
1
  ---
2
  base_model: Qwen/Qwen2.5-0.5B-Instruct
 
3
  library_name: transformers
4
- model_name: Qwen2.5-1.5B-Open-R1-Distill
5
  tags:
6
  - generated_from_trainer
 
7
  - trl
8
  - sft
9
  licence: license
10
  ---
11
 
12
- # Model Card for Qwen2.5-1.5B-Open-R1-Distill
13
 
14
- This model is a fine-tuned version of [Qwen/Qwen2.5-0.5B-Instruct](https://huggingface.co/Qwen/Qwen2.5-0.5B-Instruct).
15
  It has been trained using [TRL](https://github.com/huggingface/trl).
16
 
17
  ## Quick start
 
1
  ---
2
  base_model: Qwen/Qwen2.5-0.5B-Instruct
3
+ datasets: YangZhoumill/post_v_1
4
  library_name: transformers
5
+ model_name: Qwen2.5-1.5B-Open-R1-Distill-3240658
6
  tags:
7
  - generated_from_trainer
8
+ - open-r1
9
  - trl
10
  - sft
11
  licence: license
12
  ---
13
 
14
+ # Model Card for Qwen2.5-1.5B-Open-R1-Distill-3240658
15
 
16
+ This model is a fine-tuned version of [Qwen/Qwen2.5-0.5B-Instruct](https://huggingface.co/Qwen/Qwen2.5-0.5B-Instruct) on the [YangZhoumill/post_v_1](https://huggingface.co/datasets/YangZhoumill/post_v_1) dataset.
17
  It has been trained using [TRL](https://github.com/huggingface/trl).
18
 
19
  ## Quick start
training_args.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:cd537ffcc6c893119ccf93412459c7d08d780fad51132960e1a4e2057f8c0085
3
  size 6072
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:6e31f77132d9a2ffedf8d30d23fe440d59e1644597d3ad6dde22265d232c7233
3
  size 6072