Zhihan Liu's picture

5

Zhihan Liu

ZHLiu627

·

AI & ML interests

LLMs

Organizations

None yet

ZHLiu627 's datasets 20

ZHLiu627/logger-a100

Updated Aug 1, 2025 • 425

ZHLiu627/logger-h100

Updated Aug 1, 2025 • 98

ZHLiu627/warm_start_sft_v2

Preview • Updated Aug 1, 2025 • 5

ZHLiu627/sciworld_dataset

Preview • Updated Aug 1, 2025 • 4

ZHLiu627/updated_qwen2.5_code_1.5b_grpo_iter0_full_data_miao_0212__self_correction_iter1_v1

Viewer • Updated Feb 27, 2025 • 29.3k • 5

ZHLiu627/dataset_qwen2.5_code_1.5b_grpo_iter0_full_data_miao_0212_2_global_step_70filtered_v1_v1

Viewer • Updated Feb 27, 2025 • 29.3k • 21 • 1

ZHLiu627/dataset_qwen2.5_code_1.5b_grpo_iter0_full_data_miao_0212_2_global_step_70filtered_v1

Viewer • Updated Feb 22, 2025 • 29.3k • 32

ZHLiu627/updated-code-qwen7-edufiltered

Viewer • Updated Feb 21, 2025 • 43k • 13

ZHLiu627/updated-code-qwen7-edu

Viewer • Updated Feb 21, 2025 • 75.6k • 34

ZHLiu627/updated_qwen2.5_code_1.5b_grpo_iter0_full_data_miao_0212__self_correction_iter1_v2filtered

Viewer • Updated Feb 19, 2025 • 28.9k • 12

ZHLiu627/updated_qwen2.5_code_1.5b_grpo_iter0_full_data_miao_0212__self_correction_iter1_v2

Viewer • Updated Feb 19, 2025 • 29.3k • 18

ZHLiu627/dataset_qwen2.5_code_1.5b_grpo_iter0_full_data_miao_0212_2_global_step_70filteredd

Viewer • Updated Feb 19, 2025 • 29.3k • 4

ZHLiu627/updated_qwen2.5_code_1.5b_grpo_iter0_full_data_miao_0212__self_correction_iter1_v1filtered

Viewer • Updated Feb 19, 2025 • 29.1k • 10

ZHLiu627/qwen2.5_code_1.5b_grpo_iter0_full_data_miao_0212__self_correction_iter1_v2

Viewer • Updated Feb 18, 2025 • 29.3k • 51

ZHLiu627/qwen2.5_code_1.5b_grpo_iter0_full_data_miao_0212__self_correction_iter1_v1

Viewer • Updated Feb 18, 2025 • 29.3k • 10

ZHLiu627/code-opc2-edu

Viewer • Updated Feb 8, 2025 • 118k • 7

ZHLiu627/ultrafeedback_binarized_with_response_full

Viewer • Updated Mar 8, 2024 • 61.1k • 5

ZHLiu627/ultrafeedback_binarized_with_response_full_part2

Viewer • Updated Mar 8, 2024 • 21.1k • 36

ZHLiu627/ultrafeedback_binarized_with_response_full_part1

Viewer • Updated Mar 8, 2024 • 20k • 7 • 1

ZHLiu627/ultrafeedback_binarized_with_response_full_part0

Viewer • Updated Mar 7, 2024 • 20k • 40