davidanugraha
/

Qwen3-4B-Concise-SimPO-StrategyB-Stage2

Text Generation

Generated from Trainer

text-generation-inference

Model card Files Files and versions

davidanugraha commited on 7 days ago

Commit

d08fa3f

·

verified ·

1 Parent(s): f6461dc

Update README.md

Files changed (1) hide show

README.md +6 -31

README.md CHANGED Viewed

@@ -1,36 +1,22 @@
 ---
 library_name: transformers
-license: other
-base_model: davidanugraha/Qwen3-4B-Concise-SimPO-StrategyB-Stage1
 tags:
 - llama-factory
 - full
 - generated_from_trainer
 model-index:
-- name: concise_phase2_short_qwen3_4b_config1_new
   results: []
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 should probably proofread and complete it, then remove this comment. -->
-# concise_phase2_short_qwen3_4b_config1_new
-This model is a fine-tuned version of [davidanugraha/Qwen3-4B-Concise-SimPO-StrategyB-Stage1](https://huggingface.co/davidanugraha/Qwen3-4B-Concise-SimPO-StrategyB-Stage1) on the dpo_concise_phase2_short dataset.
-## Model description
-More information needed
-## Intended uses & limitations
-More information needed
-## Training and evaluation data
-More information needed
-## Training procedure
 ### Training hyperparameters
@@ -47,15 +33,4 @@ The following hyperparameters were used during training:
 - optimizer: Use OptimizerNames.ADAMW_TORCH with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
 - lr_scheduler_type: linear
 - lr_scheduler_warmup_ratio: 0.1
-- num_epochs: 2.0
-### Training results
-### Framework versions
-- Transformers 4.52.4
-- Pytorch 2.7.1
-- Datasets 3.6.0
-- Tokenizers 0.21.1

 ---
 library_name: transformers
+license: cc-by-4.0
+base_model: Qwen3-4B-Concise-SimPO-StrategyB-Stage1
 tags:
 - llama-factory
 - full
 - generated_from_trainer
 model-index:
+- name: Qwen3-4B-Concise-SimPO-StrategyB-Stage2
   results: []
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 should probably proofread and complete it, then remove this comment. -->
+# Qwen3-4B-Concise-SimPO-StrategyB-Stage1
+This model is a fine-tuned version of [davidanugraha/Qwen3-4B-Concise-SimPO-StrategyB-Stage1](https://huggingface.co/davidanugraha/Qwen3-4B-Concise-SimPO-StrategyB-Stage1), in which the base model is Qwen/Qwen3-4B.
 ### Training hyperparameters
 - optimizer: Use OptimizerNames.ADAMW_TORCH with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
 - lr_scheduler_type: linear
 - lr_scheduler_warmup_ratio: 0.1
+- num_epochs: 2.0