·
AI & ML interests
LLMs
Organizations
None yet
Updated • 425
ZHLiu627/warm_start_sft_v2
Preview
• Updated • 5
ZHLiu627/sciworld_dataset
Preview
• Updated • 4
ZHLiu627/updated_qwen2.5_code_1.5b_grpo_iter0_full_data_miao_0212__self_correction_iter1_v1
Viewer
• Updated • 29.3k • 5
ZHLiu627/dataset_qwen2.5_code_1.5b_grpo_iter0_full_data_miao_0212_2_global_step_70filtered_v1_v1
Viewer
• Updated • 29.3k • 21
• 1
ZHLiu627/dataset_qwen2.5_code_1.5b_grpo_iter0_full_data_miao_0212_2_global_step_70filtered_v1
Viewer
• Updated • 29.3k • 32
ZHLiu627/updated-code-qwen7-edufiltered
Viewer
• Updated • 43k • 13
ZHLiu627/updated-code-qwen7-edu
Viewer
• Updated • 75.6k • 34
ZHLiu627/updated_qwen2.5_code_1.5b_grpo_iter0_full_data_miao_0212__self_correction_iter1_v2filtered
Viewer
• Updated • 28.9k • 12
ZHLiu627/updated_qwen2.5_code_1.5b_grpo_iter0_full_data_miao_0212__self_correction_iter1_v2
Viewer
• Updated • 29.3k • 18
ZHLiu627/dataset_qwen2.5_code_1.5b_grpo_iter0_full_data_miao_0212_2_global_step_70filteredd
Viewer
• Updated • 29.3k • 4
ZHLiu627/updated_qwen2.5_code_1.5b_grpo_iter0_full_data_miao_0212__self_correction_iter1_v1filtered
Viewer
• Updated • 29.1k • 10
ZHLiu627/qwen2.5_code_1.5b_grpo_iter0_full_data_miao_0212__self_correction_iter1_v2
Viewer
• Updated • 29.3k • 51
ZHLiu627/qwen2.5_code_1.5b_grpo_iter0_full_data_miao_0212__self_correction_iter1_v1
Viewer
• Updated • 29.3k • 10
Viewer
• Updated • 118k • 7
ZHLiu627/ultrafeedback_binarized_with_response_full
Viewer
• Updated • 61.1k • 5
ZHLiu627/ultrafeedback_binarized_with_response_full_part2
Viewer
• Updated • 21.1k • 36
ZHLiu627/ultrafeedback_binarized_with_response_full_part1
Viewer
• Updated • 20k • 7
• 1
ZHLiu627/ultrafeedback_binarized_with_response_full_part0
Viewer
• Updated • 20k • 40