Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

lhl616
/
Qwen3-4B-Base-ppo

Safetensors
qwen3
Model card Files Files and versions
xet
Community
Qwen3-4B-Base-ppo / _critic
4.9 MB
  • 1 contributor
History: 1 commit
lhl616's picture
lhl616
Upload final model from root of Qwen3-4B-Base-ppo
2c3ac7a verified about 1 month ago
  • global_step192
    Upload final model from root of Qwen3-4B-Base-ppo about 1 month ago
  • global_step216
    Upload final model from root of Qwen3-4B-Base-ppo about 1 month ago
  • global_step312
    Upload final model from root of Qwen3-4B-Base-ppo about 1 month ago
  • latest
    14 Bytes
    Upload final model from root of Qwen3-4B-Base-ppo about 1 month ago
  • zero_to_fp32.py
    33.3 kB
    Upload final model from root of Qwen3-4B-Base-ppo about 1 month ago