rl_math_rl_ready_models_dapo_2812

RL checkpoint uploaded from local storage (models only).

  • Source: /shared/home/yizhan/math-rl/ready_models/dapo_2812
  • Type: full
  • Uploaded: 2026-05-21T15:05:11.804515+00:00
Downloads last month
15
Safetensors
Model size
4B params
Tensor type
BF16
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support