quanjun/OLMo3-190M-zh-full-continue
持续预训练版本:基于 quanjun/OLMo3-190M-zh-full-continue, 在 cmz1024/llm101-olmo3-zh-demo-data/continue 数据上继续训练。
训练配置
- 数据:cmz1024/llm101-olmo3-zh-demo-data/continue/full_v31.bin
- LR:2e-4(低 LR 防止灾难性遗忘)
- Warmup:10%
用法
from transformers import AutoModelForCausalLM, AutoTokenizer
model = AutoModelForCausalLM.from_pretrained("quanjun/OLMo3-190M-zh-full-continue")
tok = AutoTokenizer.from_pretrained("quanjun/OLMo3-190M-zh-full-continue")
- Downloads last month
- 10
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support
Model tree for quanjun/OLMo3-190M-zh-full-continue
Unable to build the model tree, the base model loops to the model itself. Learn more.