bashanyeyu/olmo3-190m-zh-full-sft

SFT(有监督微调)版本:基于 bashanyeyu/olmo3-190m-zh-full-sft, 使用对话格式数据进行微调,学习指令遵循能力。

数据来源

训练配置

  • LR:5e-05(低 LR 避免灾难性遗忘)
  • Warmup:5.0%
  • Max Seq Length:2048

用法

from transformers import AutoModelForCausalLM, AutoTokenizer
model = AutoModelForCausalLM.from_pretrained("bashanyeyu/olmo3-190m-zh-full-sft")
tok = AutoTokenizer.from_pretrained("bashanyeyu/olmo3-190m-zh-full-sft")
Downloads last month
39
Safetensors
Model size
0.2B params
Tensor type
BF16
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for bashanyeyu/olmo3-190m-zh-full-sft

Unable to build the model tree, the base model loops to the model itself. Learn more.

Space using bashanyeyu/olmo3-190m-zh-full-sft 1