Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
ZongzeLi
/
new-ckpts
like
0
Safetensors
Model card
Files
Files and versions
xet
Community
main
new-ckpts
757 GB
1 contributor
History:
556 commits
ZongzeLi
Upload 'depo-math': MATH-3B/checkpoint-157/zero_to_fp32.py
d792a4e
verified
4 months ago
depo-distill-8B
Upload 'depo-distill-8B': checkpoint-187/global_step187/bf16_zero_pp_rank_0_mp_rank_00_optim_states.pt
4 months ago
depo-math
Upload 'depo-math': MATH-3B/checkpoint-157/zero_to_fp32.py
4 months ago
depo-math_w_KL
Upload 'depo-math_w_KL': MATH-3B/checkpoint-157/global_step156/bf16_zero_pp_rank_0_mp_rank_00_optim_states.pt
4 months ago
depo-random-mask-thinking_0.2_lr_5.0e-06
Upload 'depo-random-mask-thinking_0.2_lr_5.0e-06': checkpoint-186/global_step186/bf16_zero_pp_rank_0_mp_rank_00_optim_states.pt
4 months ago
depo-random-mask-thinking_1.0_lr_5.0e-06
Upload 'depo-random-mask-thinking_1.0_lr_5.0e-06': checkpoint-748/generation_config.json
4 months ago
depo-random-no_mask_lr_5.0e-06
Upload 'depo-random-no_mask_lr_5.0e-06': checkpoint-186/global_step186/bf16_zero_pp_rank_0_mp_rank_00_optim_states.pt
4 months ago
depo-training-noisy-0.1
Upload 'depo-training-noisy-0.1': checkpoint-748/global_step747/bf16_zero_pp_rank_0_mp_rank_00_optim_states.pt
4 months ago
depo-training-noisy-0.3
Upload 'depo-training-noisy-0.3': checkpoint-748/global_step747/bf16_zero_pp_rank_0_mp_rank_00_optim_states.pt
4 months ago
depo-training-noisy-0.5
Upload 'depo-training-noisy-0.5': checkpoint-748/global_step747/bf16_zero_pp_rank_0_mp_rank_00_optim_states.pt
4 months ago
grpo-distill-8B
Upload 'grpo-distill-8B': checkpoint-187/global_step187/bf16_zero_pp_rank_0_mp_rank_00_optim_states.pt
4 months ago
.gitattributes
4.07 kB
Upload 'depo-math': MATH-3B/checkpoint-157/tokenizer.json
4 months ago