leonMW commited on
Commit
d61594a
·
verified ·
1 Parent(s): 5e54e19

Training in progress, epoch 1

Browse files
Files changed (3) hide show
  1. README.md +3 -5
  2. model.safetensors +1 -1
  3. training_args.bin +2 -2
README.md CHANGED
@@ -1,19 +1,17 @@
1
  ---
2
  base_model: deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B
3
- datasets: AIML-TUDA/SLR-Bench
4
  library_name: transformers
5
  model_name: DeepSeek-R1-Distill-Qwen-1.5B-SFT-Easy
6
  tags:
7
  - generated_from_trainer
8
- - trl
9
- - open-r1
10
  - sft
 
11
  licence: license
12
  ---
13
 
14
  # Model Card for DeepSeek-R1-Distill-Qwen-1.5B-SFT-Easy
15
 
16
- This model is a fine-tuned version of [deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B](https://huggingface.co/deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B) on the [AIML-TUDA/SLR-Bench](https://huggingface.co/datasets/AIML-TUDA/SLR-Bench) dataset.
17
  It has been trained using [TRL](https://github.com/huggingface/trl).
18
 
19
  ## Quick start
@@ -29,7 +27,7 @@ print(output["generated_text"])
29
 
30
  ## Training procedure
31
 
32
- [<img src="https://raw.githubusercontent.com/wandb/assets/main/wandb-github-badge-28.svg" alt="Visualize in Weights & Biases" width="150" height="24"/>](https://wandb.ai/leonwenderoth-tu-darmstadt/huggingface/runs/8u8833i1)
33
 
34
 
35
  This model was trained with SFT.
 
1
  ---
2
  base_model: deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B
 
3
  library_name: transformers
4
  model_name: DeepSeek-R1-Distill-Qwen-1.5B-SFT-Easy
5
  tags:
6
  - generated_from_trainer
 
 
7
  - sft
8
+ - trl
9
  licence: license
10
  ---
11
 
12
  # Model Card for DeepSeek-R1-Distill-Qwen-1.5B-SFT-Easy
13
 
14
+ This model is a fine-tuned version of [deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B](https://huggingface.co/deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B).
15
  It has been trained using [TRL](https://github.com/huggingface/trl).
16
 
17
  ## Quick start
 
27
 
28
  ## Training procedure
29
 
30
+ [<img src="https://raw.githubusercontent.com/wandb/assets/main/wandb-github-badge-28.svg" alt="Visualize in Weights & Biases" width="150" height="24"/>](https://wandb.ai/leonwenderoth-tu-darmstadt/huggingface/runs/srr77x46)
31
 
32
 
33
  This model was trained with SFT.
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:1790eca7ae0bcbf7525a0d0d255c3deabf863de203131692f6988f3c9c6c1b83
3
  size 3554214752
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:78e8b8f48462e40aadeb7791b19e019b29bd7431927f37107f94c5720da14b36
3
  size 3554214752
training_args.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:22864e98863eee03708d47dcd85f80c5980bc42fc41122db33825b2e36a632c8
3
- size 7889
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:63e46a5be48c79c044d798315c0545b07d8726d5a9cd1b58693e4d606342229b
3
+ size 7953