AI & ML interests
None yet
Organizations
None yet
happynew111/NEW_qwen2_5_MATH_1_5b_grpo_reg_beta_0.1_gspo_bce_4
Viewer
• Updated • 1 • 3
happynew111/NEW_qwen2_5_MATH_1_5b_grpo_reg_beta_0.1_gspo_bce_3
Viewer
• Updated • 1 • 4
happynew111/NEW_qwen2_5_MATH_1_5b_grpo_reg_beta_0.1_gspo_bce_2
Viewer
• Updated • 1 • 4
happynew111/NEW_qwen2_5_MATH_1_5b_grpo_reg_beta_0.1_gspo_bce_1
Viewer
• Updated • 1 • 4
happynew111/NEW_qwen2_5_MATH_1_5b_grpo_reg_beta_0.1_gpg_bce_5
Viewer
• Updated • 1 • 3
happynew111/NEW_qwen2_5_MATH_1_5b_grpo_reg_beta_0.1_gpg_bce_4
Viewer
• Updated • 1 • 4
happynew111/NEW_qwen2_5_MATH_1_5b_grpo_reg_beta_0.1_gpg_bce_3
Viewer
• Updated • 1 • 5
happynew111/NEW_qwen2_5_MATH_1_5b_grpo_reg_beta_0.1_gpg_bce_2
Viewer
• Updated • 1 • 4
happynew111/NEW_qwen2_5_MATH_1_5b_grpo_reg_beta_0.1_gpg_bce_1
Viewer
• Updated • 1 • 4
happynew111/haotian_data-GPS-AR-Lopti-master
Preview
• Updated • 54
happynew111/haotian_data-GPS-Model
happynew111/haotian_data-GPS-CCGSPG_for_me
happynew111/haotian_data-GPS-CCGSPG_for_me_second
happynew111/haotian_data-GPS-GAINRL-main
happynew111/haotian_data-GPS-lm-evaluation-harness
Updated • 24
happynew111/haotian_data-GPS-verl-main-CL
Updated • 108
happynew111/haotian_data-GPS-verl-main
Viewer
• Updated • 13.5k • 4
happynew111/MATH_BS_BCE_train_json
Viewer
• Updated • 8.97M • 4
happynew111/MATH_BS_BCE_valid_log_json
Viewer
• Updated • 1.51M • 4
happynew111/AR_train_json
Viewer
• Updated • 12.5k • 3
happynew111/AR_valid_log_json
Preview
• Updated • 4
happynew111/MATH_train_json
Viewer
• Updated • 7 • 4
happynew111/MATH_valid_log_json