YAML Metadata Warning: empty or missing yaml metadata in repo card (https://huggingface.co/docs/hub/model-cards#model-card-metadata)

这个目录下的模型是我利用 https://huggingface.co/wh-zhu/qwen2.5-1.5b-cot 使用 verl 框架, 使用 PPO 算法在 GSM8K 数据集上训练 50 个 step 得到的 Actor 模型.

对应的参数见: https://wandb.ai/bohuang/qwen_2_5_1_5b_cot_PPO/runs/65v9s1wo?nw=nwuserbohuang

Downloads last month
4
Safetensors
Model size
2B params
Tensor type
BF16
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support