Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

DatPySci
/
RLVR-SGDM-Gap

Safetensors
Model card Files Files and versions
xet
Community
RLVR-SGDM-Gap / SFT /Qwen2.5-3B-Instruct-s1k_32
105 GB
Ctrl+K
Ctrl+K
  • 1 contributor
History: 1 commit
DatPySci's picture
DatPySci
upload sft
c065dbd verified about 1 month ago
  • global_step_124
    upload sft about 1 month ago
  • global_step_186
    upload sft about 1 month ago
  • global_step_248
    upload sft about 1 month ago
  • global_step_310
    upload sft about 1 month ago
  • global_step_372
    upload sft about 1 month ago
  • global_step_434
    upload sft about 1 month ago
  • global_step_496
    upload sft about 1 month ago
  • global_step_62
    upload sft about 1 month ago
  • latest_checkpointed_iteration.txt
    3 Bytes
    upload sft about 1 month ago