| ---
|
| base_model: complexly/olmo3-190m-zh-sft
|
| license: apache-2.0
|
| language:
|
| - zh
|
| tags:
|
| - llm001
|
| - olmo3
|
| - chinese
|
| - sft
|
| - supervised-finetuning
|
| ---
|
|
|
| # complexly/olmo3-190m-zh-sft
|
|
|
| SFT(有监督微调)版本:基于complexly/olmo3-190m-zh-continue,
|
| 使用对话格式数据进行微调,学习指令遵循能力。
|
|
|
| ## 数据来源
|
| - 训练数据:cmz1024/llm101-olmo3-zh-demo-data
|
| - 子路径:sft/sft_t2t_mini.jsonl
|
|
|
| ## 训练配置
|
| - Learning Rate:5.0e-5
|
| - Warmup:5%
|
| - Epochs:3 epoch
|
| - Max Seq Length:2048
|
| - 使用 assistant_only_loss(仅对 assistant 部分计算 loss)
|
| - per_device_train_batch_size: 24
|
| - packing: true
|
|
|
| ## 用法
|
| ```python
|
| from transformers import AutoModelForCausalLM, AutoTokenizer
|
| from transformers import pipeline
|
|
|
| model = AutoModelForCausalLM.from_pretrained("{target_repo}")
|
| tok = AutoTokenizer.from_pretrained("{target_repo}")
|
|
|
| # 使用 chat template
|
| messages = [{{"role": "user", "content": "你好,请介绍一下北京"}}]
|
| inputs = tok.apply_chat_template(messages, return_tensors="pt")
|
| outputs = model.generate(inputs, max_new_tokens=200)
|
| print(tok.decode(outputs[0]))
|
| ```
|
|
|