·
AI & ML interests
LLMs
Recent Activity
Organizations
None yet
ZHLiu627/warm_start_sft_v2
Preview
•
Updated
•
4
ZHLiu627/sciworld_dataset
Preview
•
Updated
•
7
ZHLiu627/updated_qwen2.5_code_1.5b_grpo_iter0_full_data_miao_0212__self_correction_iter1_v1
Viewer
•
Updated
•
29.3k
•
1
ZHLiu627/dataset_qwen2.5_code_1.5b_grpo_iter0_full_data_miao_0212_2_global_step_70filtered_v1_v1
Viewer
•
Updated
•
29.3k
•
1
•
1
ZHLiu627/dataset_qwen2.5_code_1.5b_grpo_iter0_full_data_miao_0212_2_global_step_70filtered_v1
Viewer
•
Updated
•
29.3k
•
1
ZHLiu627/updated-code-qwen7-edufiltered
Viewer
•
Updated
•
43k
•
2
ZHLiu627/updated-code-qwen7-edu
Viewer
•
Updated
•
75.6k
•
1
ZHLiu627/updated_qwen2.5_code_1.5b_grpo_iter0_full_data_miao_0212__self_correction_iter1_v2filtered
Viewer
•
Updated
•
28.9k
•
2
ZHLiu627/updated_qwen2.5_code_1.5b_grpo_iter0_full_data_miao_0212__self_correction_iter1_v2
Viewer
•
Updated
•
29.3k
•
1
ZHLiu627/dataset_qwen2.5_code_1.5b_grpo_iter0_full_data_miao_0212_2_global_step_70filteredd
Viewer
•
Updated
•
29.3k
•
2
ZHLiu627/updated_qwen2.5_code_1.5b_grpo_iter0_full_data_miao_0212__self_correction_iter1_v1filtered
Viewer
•
Updated
•
29.1k
•
1
ZHLiu627/qwen2.5_code_1.5b_grpo_iter0_full_data_miao_0212__self_correction_iter1_v2
Viewer
•
Updated
•
29.3k
•
1
ZHLiu627/qwen2.5_code_1.5b_grpo_iter0_full_data_miao_0212__self_correction_iter1_v1
Viewer
•
Updated
•
29.3k
•
1
Viewer
•
Updated
•
118k
•
2
ZHLiu627/ultrafeedback_binarized_with_response_full
Viewer
•
Updated
•
61.1k
•
2
ZHLiu627/ultrafeedback_binarized_with_response_full_part2
Viewer
•
Updated
•
21.1k
•
1
ZHLiu627/ultrafeedback_binarized_with_response_full_part1
Viewer
•
Updated
•
20k
•
1
•
1
ZHLiu627/ultrafeedback_binarized_with_response_full_part0
Viewer
•
Updated
•
20k
•
1