davidanugraha commited on
Commit
d08fa3f
·
verified ·
1 Parent(s): f6461dc

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +6 -31
README.md CHANGED
@@ -1,36 +1,22 @@
1
  ---
2
  library_name: transformers
3
- license: other
4
- base_model: davidanugraha/Qwen3-4B-Concise-SimPO-StrategyB-Stage1
5
  tags:
6
  - llama-factory
7
  - full
8
  - generated_from_trainer
9
  model-index:
10
- - name: concise_phase2_short_qwen3_4b_config1_new
11
  results: []
12
  ---
13
 
14
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
15
  should probably proofread and complete it, then remove this comment. -->
16
 
17
- # concise_phase2_short_qwen3_4b_config1_new
18
 
19
- This model is a fine-tuned version of [davidanugraha/Qwen3-4B-Concise-SimPO-StrategyB-Stage1](https://huggingface.co/davidanugraha/Qwen3-4B-Concise-SimPO-StrategyB-Stage1) on the dpo_concise_phase2_short dataset.
20
-
21
- ## Model description
22
-
23
- More information needed
24
-
25
- ## Intended uses & limitations
26
-
27
- More information needed
28
-
29
- ## Training and evaluation data
30
-
31
- More information needed
32
-
33
- ## Training procedure
34
 
35
  ### Training hyperparameters
36
 
@@ -47,15 +33,4 @@ The following hyperparameters were used during training:
47
  - optimizer: Use OptimizerNames.ADAMW_TORCH with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
48
  - lr_scheduler_type: linear
49
  - lr_scheduler_warmup_ratio: 0.1
50
- - num_epochs: 2.0
51
-
52
- ### Training results
53
-
54
-
55
-
56
- ### Framework versions
57
-
58
- - Transformers 4.52.4
59
- - Pytorch 2.7.1
60
- - Datasets 3.6.0
61
- - Tokenizers 0.21.1
 
1
  ---
2
  library_name: transformers
3
+ license: cc-by-4.0
4
+ base_model: Qwen3-4B-Concise-SimPO-StrategyB-Stage1
5
  tags:
6
  - llama-factory
7
  - full
8
  - generated_from_trainer
9
  model-index:
10
+ - name: Qwen3-4B-Concise-SimPO-StrategyB-Stage2
11
  results: []
12
  ---
13
 
14
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
15
  should probably proofread and complete it, then remove this comment. -->
16
 
17
+ # Qwen3-4B-Concise-SimPO-StrategyB-Stage1
18
 
19
+ This model is a fine-tuned version of [davidanugraha/Qwen3-4B-Concise-SimPO-StrategyB-Stage1](https://huggingface.co/davidanugraha/Qwen3-4B-Concise-SimPO-StrategyB-Stage1), in which the base model is Qwen/Qwen3-4B.
 
 
 
 
 
 
 
 
 
 
 
 
 
 
20
 
21
  ### Training hyperparameters
22
 
 
33
  - optimizer: Use OptimizerNames.ADAMW_TORCH with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
34
  - lr_scheduler_type: linear
35
  - lr_scheduler_warmup_ratio: 0.1
36
+ - num_epochs: 2.0