| *.7z filter=lfs diff=lfs merge=lfs -text | |
| *.arrow filter=lfs diff=lfs merge=lfs -text | |
| *.bin filter=lfs diff=lfs merge=lfs -text | |
| *.bz2 filter=lfs diff=lfs merge=lfs -text | |
| *.ckpt filter=lfs diff=lfs merge=lfs -text | |
| *.ftz filter=lfs diff=lfs merge=lfs -text | |
| *.gz filter=lfs diff=lfs merge=lfs -text | |
| *.h5 filter=lfs diff=lfs merge=lfs -text | |
| *.joblib filter=lfs diff=lfs merge=lfs -text | |
| *.lfs.* filter=lfs diff=lfs merge=lfs -text | |
| *.mlmodel filter=lfs diff=lfs merge=lfs -text | |
| *.model filter=lfs diff=lfs merge=lfs -text | |
| *.msgpack filter=lfs diff=lfs merge=lfs -text | |
| *.npy filter=lfs diff=lfs merge=lfs -text | |
| *.npz filter=lfs diff=lfs merge=lfs -text | |
| *.onnx filter=lfs diff=lfs merge=lfs -text | |
| *.ot filter=lfs diff=lfs merge=lfs -text | |
| *.parquet filter=lfs diff=lfs merge=lfs -text | |
| *.pb filter=lfs diff=lfs merge=lfs -text | |
| *.pickle filter=lfs diff=lfs merge=lfs -text | |
| *.pkl filter=lfs diff=lfs merge=lfs -text | |
| *.pt filter=lfs diff=lfs merge=lfs -text | |
| *.pth filter=lfs diff=lfs merge=lfs -text | |
| *.rar filter=lfs diff=lfs merge=lfs -text | |
| *.safetensors filter=lfs diff=lfs merge=lfs -text | |
| saved_model/**/* filter=lfs diff=lfs merge=lfs -text | |
| *.tar.* filter=lfs diff=lfs merge=lfs -text | |
| *.tar filter=lfs diff=lfs merge=lfs -text | |
| *.tflite filter=lfs diff=lfs merge=lfs -text | |
| *.tgz filter=lfs diff=lfs merge=lfs -text | |
| *.wasm filter=lfs diff=lfs merge=lfs -text | |
| *.xz filter=lfs diff=lfs merge=lfs -text | |
| *.zip filter=lfs diff=lfs merge=lfs -text | |
| *.zst filter=lfs diff=lfs merge=lfs -text | |
| *tfevents* filter=lfs diff=lfs merge=lfs -text | |
| sft_search/JOB:29477\#LR:1e-5\#BASE:Qwen2.5-7B-Instruct\#TOKEN:Qwen2.5-7B-Instruct\#BSZ:2\#ACC:4_ablation_ques_keywords_filtered_data_727_ablation/checkpoint-46/tokenizer.json filter=lfs diff=lfs merge=lfs -text | |
| sft_search/JOB:19929\#LR:1e-5\#BASE:Qwen2.5-7B-Instruct\#TOKEN:Qwen2.5-7B-Instruct\#BSZ:2\#ACC:4_ablation_resp_difficulty_1678_random_sample_871_ablation/checkpoint-68/tokenizer.json filter=lfs diff=lfs merge=lfs -text | |
| sft_search/JOB:19929\#LR:1e-5\#BASE:Qwen2.5-7B-Instruct\#TOKEN:Qwen2.5-7B-Instruct\#BSZ:2\#ACC:4_ablation_resp_difficulty_1678_random_sample_871_ablation/checkpoint-78/tokenizer.json filter=lfs diff=lfs merge=lfs -text | |
| sft_search/qwen7b_sft_871_checkpoint-78/tokenizer.json filter=lfs diff=lfs merge=lfs -text | |
| sft_search/JOB:28315\#LR:1e-5\#BASE:DeepSeek-R1-Distill-Qwen-7B\#TOKEN:DeepSeek-R1-Distill-Qwen-7B\#BSZ:2\#ACC:4_no_error_data_871/checkpoint-68/tokenizer.json filter=lfs diff=lfs merge=lfs -text | |
| sft_search/JOB:30936\#LR:1e-5\#BASE:Qwen2.5-7B-Instruct\#TOKEN:Qwen2.5-7B-Instruct\#BSZ:2\#ACC:4_math_qwq_4524_selected_add_prompt_871/checkpoint-68/tokenizer.json filter=lfs diff=lfs merge=lfs -text | |
| sft_search/JOB:30936\#LR:1e-5\#BASE:Qwen2.5-7B-Instruct\#TOKEN:Qwen2.5-7B-Instruct\#BSZ:2\#ACC:4_math_qwq_4524_selected_add_prompt_871/checkpoint-55/tokenizer.json filter=lfs diff=lfs merge=lfs -text | |
| sft_search/JOB:24815\#LR:1e-5\#BASE:Qwen2.5-7B-Instruct\#TOKEN:Qwen2.5-7B-Instruct\#BSZ:2\#ACC:4_ablation_resp_difficulty_1678_ablation/checkpoint-156/tokenizer.json filter=lfs diff=lfs merge=lfs -text | |
| sft_search/JOB:24815\#LR:1e-5\#BASE:Qwen2.5-7B-Instruct\#TOKEN:Qwen2.5-7B-Instruct\#BSZ:2\#ACC:4_ablation_resp_difficulty_1678_ablation/checkpoint-131/tokenizer.json filter=lfs diff=lfs merge=lfs -text | |
| sft_search/QwQ-32B-sft_1.1k_ckpt91/tokenizer.json filter=lfs diff=lfs merge=lfs -text | |
| sft_search/JOB:12290\#LR:1e-5\#BASE:Qwen2.5-7B-Instruct\#TOKEN:Qwen2.5-7B-Instruct\#BSZ:2\#ACC:4_no_error_data_871_871_wo_mask/checkpoint-68/tokenizer.json filter=lfs diff=lfs merge=lfs -text | |
| sft_search/JOB:12290\#LR:1e-5\#BASE:Qwen2.5-7B-Instruct\#TOKEN:Qwen2.5-7B-Instruct\#BSZ:2\#ACC:4_no_error_data_871_871_wo_mask/checkpoint-78/tokenizer.json filter=lfs diff=lfs merge=lfs -text | |
| sft_search/JOB:17315\#LR:1e-5\#BASE:DeepSeek-R1-Distill-Qwen-32\#TOKEN:DeepSeek-R1-Distill-Qwen-32\#BSZ:2\#ACC:4_no_error_data_871/checkpoint-41/tokenizer.json filter=lfs diff=lfs merge=lfs -text | |
| sft_search/JOB:2284\#LR:1e-5\#BASE:QwQ-32B\#TOKEN:QwQ-32B\#BSZ:2\#ACC:4_no_error_data_871/checkpoint-55/tokenizer.json filter=lfs diff=lfs merge=lfs -text | |
| sft_search/JOB:2284\#LR:1e-5\#BASE:QwQ-32B\#TOKEN:QwQ-32B\#BSZ:2\#ACC:4_no_error_data_871/checkpoint-41/tokenizer.json filter=lfs diff=lfs merge=lfs -text | |
| sft_search/JOB:8557\#LR:1e-5\#BASE:Qwen2.5-7B-Instruct\#TOKEN:Qwen2.5-7B-Instruct\#BSZ:2\#ACC:4_ablation_resp_reason_1099_ablation/checkpoint-102/tokenizer.json filter=lfs diff=lfs merge=lfs -text | |
| sft_search/JOB:\#LR:1e-5\#BASE:QwQ-32B\#TOKEN:QwQ-32B\#BSZ:2\#ACC:4_merged_syn_long_359_sft_1533/checkpoint-528/tokenizer.json filter=lfs diff=lfs merge=lfs -text | |
| sft_search/JOB:30702\#LR:1e-5\#BASE:Qwen2.5-7B-Instruct\#TOKEN:Qwen2.5-7B-Instruct\#BSZ:2\#ACC:4_ablation_resp_format_1064_random_sample_871_ablation/checkpoint-27/tokenizer.json filter=lfs diff=lfs merge=lfs -text | |
| sft_search/JOB:30702\#LR:1e-5\#BASE:Qwen2.5-7B-Instruct\#TOKEN:Qwen2.5-7B-Instruct\#BSZ:2\#ACC:4_ablation_resp_format_1064_random_sample_871_ablation/checkpoint-68/tokenizer.json filter=lfs diff=lfs merge=lfs -text | |
| sft_search/JOB:30702\#LR:1e-5\#BASE:Qwen2.5-7B-Instruct\#TOKEN:Qwen2.5-7B-Instruct\#BSZ:2\#ACC:4_ablation_resp_format_1064_random_sample_871_ablation/checkpoint-55/tokenizer.json filter=lfs diff=lfs merge=lfs -text | |
| sft_search/JOB:30702\#LR:1e-5\#BASE:Qwen2.5-7B-Instruct\#TOKEN:Qwen2.5-7B-Instruct\#BSZ:2\#ACC:4_ablation_resp_format_1064_random_sample_871_ablation/checkpoint-78/tokenizer.json filter=lfs diff=lfs merge=lfs -text | |
| sft_search/JOB:30702\#LR:1e-5\#BASE:Qwen2.5-7B-Instruct\#TOKEN:Qwen2.5-7B-Instruct\#BSZ:2\#ACC:4_ablation_resp_format_1064_random_sample_871_ablation/checkpoint-41/tokenizer.json filter=lfs diff=lfs merge=lfs -text | |
| sft_search/JOB:14123\#LR:1e-5\#BASE:Qwen2.5-7B-Instruct\#TOKEN:Qwen2.5-7B-Instruct\#BSZ:2\#ACC:4_ablation_ques_domain_filtered_data_738_ablation/checkpoint-46/tokenizer.json filter=lfs diff=lfs merge=lfs -text | |
| sft_search/JOB:5409\#LR:1e-5\#BASE:Qwen2.5-7B-Instruct\#TOKEN:Qwen2.5-7B-Instruct\#BSZ:2\#ACC:4_ablation_resp_reason_1099_random_sample_871_ablation/checkpoint-67/tokenizer.json filter=lfs diff=lfs merge=lfs -text | |
| sft_search/JOB:5409\#LR:1e-5\#BASE:Qwen2.5-7B-Instruct\#TOKEN:Qwen2.5-7B-Instruct\#BSZ:2\#ACC:4_ablation_resp_reason_1099_random_sample_871_ablation/checkpoint-78/tokenizer.json filter=lfs diff=lfs merge=lfs -text | |
| sft_search/JOB:3528\#LR:1e-5\#BASE:Qwen2.5-7B-Instruct\#TOKEN:Qwen2.5-7B-Instruct\#BSZ:2\#ACC:4_ablation_subquery_1073_ablation/checkpoint-102/tokenizer.json filter=lfs diff=lfs merge=lfs -text | |
| sft_search/JOB:15751\#LR:1e-5\#BASE:Qwen2.5-7B-Instruct\#TOKEN:Qwen2.5-7B-Instruct\#BSZ:2\#ACC:4_ablation_subquery_1073_random_sample_871_ablation/checkpoint-27/tokenizer.json filter=lfs diff=lfs merge=lfs -text | |
| sft_search/JOB:15751\#LR:1e-5\#BASE:Qwen2.5-7B-Instruct\#TOKEN:Qwen2.5-7B-Instruct\#BSZ:2\#ACC:4_ablation_subquery_1073_random_sample_871_ablation/checkpoint-68/tokenizer.json filter=lfs diff=lfs merge=lfs -text | |
| sft_search/JOB:15751\#LR:1e-5\#BASE:Qwen2.5-7B-Instruct\#TOKEN:Qwen2.5-7B-Instruct\#BSZ:2\#ACC:4_ablation_subquery_1073_random_sample_871_ablation/checkpoint-55/tokenizer.json filter=lfs diff=lfs merge=lfs -text | |
| sft_search/JOB:15751\#LR:1e-5\#BASE:Qwen2.5-7B-Instruct\#TOKEN:Qwen2.5-7B-Instruct\#BSZ:2\#ACC:4_ablation_subquery_1073_random_sample_871_ablation/checkpoint-78/tokenizer.json filter=lfs diff=lfs merge=lfs -text | |
| sft_search/JOB:15751\#LR:1e-5\#BASE:Qwen2.5-7B-Instruct\#TOKEN:Qwen2.5-7B-Instruct\#BSZ:2\#ACC:4_ablation_subquery_1073_random_sample_871_ablation/checkpoint-41/tokenizer.json filter=lfs diff=lfs merge=lfs -text | |
| sft_search/JOB:11361\#LR:1e-5\#BASE:Qwen2.5-7B-Instruct\#TOKEN:Qwen2.5-7B-Instruct\#BSZ:2\#ACC:4_no_error_data_871/checkpoint-109/tokenizer.json filter=lfs diff=lfs merge=lfs -text | |
| sft_search/JOB:11361\#LR:1e-5\#BASE:Qwen2.5-7B-Instruct\#TOKEN:Qwen2.5-7B-Instruct\#BSZ:2\#ACC:4_no_error_data_871/checkpoint-81/tokenizer.json filter=lfs diff=lfs merge=lfs -text | |
| sft_search/JOB:11361\#LR:1e-5\#BASE:Qwen2.5-7B-Instruct\#TOKEN:Qwen2.5-7B-Instruct\#BSZ:2\#ACC:4_no_error_data_871/checkpoint-162/tokenizer.json filter=lfs diff=lfs merge=lfs -text | |
| sft_search/JOB:11361\#LR:1e-5\#BASE:Qwen2.5-7B-Instruct\#TOKEN:Qwen2.5-7B-Instruct\#BSZ:2\#ACC:4_no_error_data_871/checkpoint-136/tokenizer.json filter=lfs diff=lfs merge=lfs -text | |
| sft_search/JOB:10634\#LR:1e-5\#BASE:Qwen2.5-7B-Instruct\#TOKEN:Qwen2.5-7B-Instruct\#BSZ:2\#ACC:4_ablation_ques_yiwenci_filtered_data_811_ablation/checkpoint-38/tokenizer.json filter=lfs diff=lfs merge=lfs -text | |
| sft_search/JOB:17483\#LR:1e-5\#BASE:Qwen2.5-32B-Instruct\#TOKEN:Qwen2.5-32B-Instruct\#BSZ:2\#ACC:4_no_error_data_871/checkpoint-55/tokenizer.json filter=lfs diff=lfs merge=lfs -text | |
| sft_search/JOB:2082\#LR:1e-5\#BASE:Qwen2.5-7B-Instruct\#TOKEN:Qwen2.5-7B-Instruct\#BSZ:2\#ACC:4_ablation_resp_format_1064_random_sample_871_ablation/checkpoint-68/tokenizer.json filter=lfs diff=lfs merge=lfs -text | |
| sft_search/JOB:2082\#LR:1e-5\#BASE:Qwen2.5-7B-Instruct\#TOKEN:Qwen2.5-7B-Instruct\#BSZ:2\#ACC:4_ablation_resp_format_1064_random_sample_871_ablation/checkpoint-78/tokenizer.json filter=lfs diff=lfs merge=lfs -text | |
| sft_search/JOB:31348\#LR:1e-5\#BASE:Qwen2.5-7B-Instruct\#TOKEN:Qwen2.5-7B-Instruct\#BSZ:2\#ACC:4_ablation_resp_format_1064_ablation/checkpoint-96/tokenizer.json filter=lfs diff=lfs merge=lfs -text | |
| sft_search/qwq_search_sft_2.7k_ckpt211/tokenizer.json filter=lfs diff=lfs merge=lfs -text | |
| sft_search/JOB:14478\#LR:1e-5\#BASE:Qwen2.5-7B-Instruct\#TOKEN:Qwen2.5-7B-Instruct\#BSZ:2\#ACC:4_ablation_subquery_1073_random_sample_871_ablation/checkpoint-78/tokenizer.json filter=lfs diff=lfs merge=lfs -text | |
| data/dup_0.json filter=lfs diff=lfs merge=lfs -text | |