zephre 7B-sft---> PPO-7B zephre 7B-sft---> PPO-7B zephre 7B-sft---> PPO-7B zephre 7B-sft---> PPO-7B zephre 7B-sft---> PPO-7B zephre 7B-sft---> PPO-7B zephre 7B-sft---> PPO-7B zephre 7B-sft---> PPO-7B zephre 7B-sft---> PPO-7B

Downloads last month
548
Safetensors
Model size
7B params
Tensor type
F32
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for ewqr2130/7B_ppo_phiRM_2GPU_3e-7step_4000

Quantizations
1 model