Files changed (1) hide show
  1. README.md +75 -63
README.md CHANGED
@@ -1,64 +1,76 @@
1
- ---
2
- library_name: peft
3
- license: apache-2.0
4
- base_model: Qwen/Qwen2.5-0.5B-Instruct
5
- tags:
6
- - generated_from_trainer
7
- model-index:
8
- - name: qwen-finetuned
9
- results: []
10
- datasets:
11
- - SkunkworksAI/reasoning-0.01
12
- metrics:
13
- - accuracy
14
- language:
15
- - en
16
- pipeline_tag: text-generation
17
- ---
18
-
19
- <!-- This model card has been generated automatically according to the information the Trainer had access to. You
20
- should probably proofread and complete it, then remove this comment. -->
21
-
22
- # qwen-finetuned
23
-
24
- This model is a fine-tuned version of [Qwen/Qwen2.5-0.5B-Instruct](https://huggingface.co/Qwen/Qwen2.5-0.5B-Instruct) on an unknown dataset.
25
-
26
- ## Model description
27
-
28
- More information needed
29
-
30
- ## Intended uses & limitations
31
-
32
- More information needed
33
-
34
- ## Training and evaluation data
35
-
36
- More information needed
37
-
38
- ## Training procedure
39
-
40
- ### Training hyperparameters
41
-
42
- The following hyperparameters were used during training:
43
- - learning_rate: 2e-05
44
- - train_batch_size: 2
45
- - eval_batch_size: 8
46
- - seed: 42
47
- - gradient_accumulation_steps: 4
48
- - total_train_batch_size: 8
49
- - optimizer: Use OptimizerNames.PAGED_ADAMW_8BIT with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
50
- - lr_scheduler_type: linear
51
- - num_epochs: 1
52
- - mixed_precision_training: Native AMP
53
-
54
- ### Training results
55
-
56
-
57
-
58
- ### Framework versions
59
-
60
- - PEFT 0.15.2
61
- - Transformers 4.51.3
62
- - Pytorch 2.6.0+cu124
63
- - Datasets 3.5.0
 
 
 
 
 
 
 
 
 
 
 
 
64
  - Tokenizers 0.21.1
 
1
+ ---
2
+ library_name: peft
3
+ license: apache-2.0
4
+ base_model: Qwen/Qwen2.5-0.5B-Instruct
5
+ tags:
6
+ - generated_from_trainer
7
+ datasets:
8
+ - SkunkworksAI/reasoning-0.01
9
+ metrics:
10
+ - accuracy
11
+ language:
12
+ - zho
13
+ - eng
14
+ - fra
15
+ - spa
16
+ - por
17
+ - deu
18
+ - ita
19
+ - rus
20
+ - jpn
21
+ - kor
22
+ - vie
23
+ - tha
24
+ - ara
25
+ pipeline_tag: text-generation
26
+ model-index:
27
+ - name: qwen-finetuned
28
+ results: []
29
+ ---
30
+
31
+ <!-- This model card has been generated automatically according to the information the Trainer had access to. You
32
+ should probably proofread and complete it, then remove this comment. -->
33
+
34
+ # qwen-finetuned
35
+
36
+ This model is a fine-tuned version of [Qwen/Qwen2.5-0.5B-Instruct](https://huggingface.co/Qwen/Qwen2.5-0.5B-Instruct) on an unknown dataset.
37
+
38
+ ## Model description
39
+
40
+ More information needed
41
+
42
+ ## Intended uses & limitations
43
+
44
+ More information needed
45
+
46
+ ## Training and evaluation data
47
+
48
+ More information needed
49
+
50
+ ## Training procedure
51
+
52
+ ### Training hyperparameters
53
+
54
+ The following hyperparameters were used during training:
55
+ - learning_rate: 2e-05
56
+ - train_batch_size: 2
57
+ - eval_batch_size: 8
58
+ - seed: 42
59
+ - gradient_accumulation_steps: 4
60
+ - total_train_batch_size: 8
61
+ - optimizer: Use OptimizerNames.PAGED_ADAMW_8BIT with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
62
+ - lr_scheduler_type: linear
63
+ - num_epochs: 1
64
+ - mixed_precision_training: Native AMP
65
+
66
+ ### Training results
67
+
68
+
69
+
70
+ ### Framework versions
71
+
72
+ - PEFT 0.15.2
73
+ - Transformers 4.51.3
74
+ - Pytorch 2.6.0+cu124
75
+ - Datasets 3.5.0
76
  - Tokenizers 0.21.1