Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
DatPySci
/
RLVR-SGDM-Gap
like
0
Safetensors
Model card
Files
Files and versions
xet
Community
main
RLVR-SGDM-Gap
/
SFT
/
Qwen2.5-3B-Instruct-s1k_32
105 GB
Ctrl+K
Ctrl+K
1 contributor
History:
1 commit
DatPySci
upload sft
c065dbd
verified
about 1 month ago
global_step_124
upload sft
about 1 month ago
global_step_186
upload sft
about 1 month ago
global_step_248
upload sft
about 1 month ago
global_step_310
upload sft
about 1 month ago
global_step_372
upload sft
about 1 month ago
global_step_434
upload sft
about 1 month ago
global_step_496
upload sft
about 1 month ago
global_step_62
upload sft
about 1 month ago
latest_checkpointed_iteration.txt
3 Bytes
upload sft
about 1 month ago