rl_math_rl_ready_models_dapo_0421

RL checkpoint uploaded from local storage (models only).

  • Source: /shared/home/yizhan/math-rl/ready_models/dapo_0421
  • Type: full
  • Uploaded: 2026-05-21T15:14:46.715700+00:00
Downloads last month
10
Safetensors
Model size
4B params
Tensor type
BF16
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support