quanjun/OLMo3-190M-zh-full-continue

持续预训练版本:基于 quanjun/OLMo3-190M-zh-full-continue, 在 cmz1024/llm101-olmo3-zh-demo-data/continue 数据上继续训练。

训练配置

  • 数据:cmz1024/llm101-olmo3-zh-demo-data/continue/full_v31.bin
  • LR:2e-4(低 LR 防止灾难性遗忘)
  • Warmup:10%

用法

from transformers import AutoModelForCausalLM, AutoTokenizer
model = AutoModelForCausalLM.from_pretrained("quanjun/OLMo3-190M-zh-full-continue")
tok = AutoTokenizer.from_pretrained("quanjun/OLMo3-190M-zh-full-continue")
Downloads last month
10
Safetensors
Model size
0.2B params
Tensor type
BF16
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for quanjun/OLMo3-190M-zh-full-continue

Unable to build the model tree, the base model loops to the model itself. Learn more.

Space using quanjun/OLMo3-190M-zh-full-continue 1