YAML Metadata
Warning:
empty or missing yaml metadata in repo card
(https://huggingface.co/docs/hub/model-cards#model-card-metadata)
这个目录下的模型是我利用 https://huggingface.co/wh-zhu/qwen2.5-1.5b-cot 使用 verl 框架, 使用 PPO 算法在 GSM8K 数据集上训练 50 个 step 得到的 Actor 模型.
对应的参数见: https://wandb.ai/bohuang/qwen_2_5_1_5b_cot_PPO/runs/65v9s1wo?nw=nwuserbohuang
- Downloads last month
- 4
Inference Providers
NEW
This model isn't deployed by any Inference Provider.
🙋
Ask for provider support