secmlr/patching_mcts_hard
Viewer
• Updated • 7.3k • 5
secmlr/patching_mcts_soft
Viewer
• Updated • 7.31k • 3
secmlr/ossfuzz_dataset_ds_correct_direct_QwQ-32B_train_len_32000_inputlen_16000
Viewer
• Updated • 139 • 2
secmlr/clean_dataset_ds_correct_direct_QwQ-32B_train_len_32000_inputlen_16000
Viewer
• Updated • 5.08k • 2
secmlr/noisy_dataset_ds_correct_direct_QwQ-32B_small_train_len_32000_inputlen_16000
Viewer
• Updated • 2.72k • 2
secmlr/noisy_dataset_ds_correct_together-deepseek-reasoner_small_train_len_32000_inputlen_16000
Viewer
• Updated • 2.91k • 4
secmlr/clean_dataset_ds_correct_together-deepseek-reasoner_train_len_32000_inputlen_16000
Viewer
• Updated • 5.14k • 8
secmlr/clean_dataset_ds_correct_QwQ-32B_train_len_32000_inputlen_16000
Viewer
• Updated • 5.08k • 4
secmlr/noisy_dataset_ds_correct_QwQ-32B_small_train_len_32000_inputlen_16000
Viewer
• Updated • 2.72k • 4
secmlr/clean_dataset_dsformat_filtered_QwQ-32B_train_len_32000_inputlen_16000
Viewer
• Updated • 5.95k • 5
secmlr/ossfuzz_dataset_filtered_QwQ-32B_train_len_32000_inputlen_16000
Viewer
• Updated • 251 • 2
Viewer
• Updated • 3.61k • 5
secmlr/reduced_noisy_dataset_filtered_QwQ-32B-Preview_small_train_len_8000_inputlen_5000
Viewer
• Updated • 3.95k • 5
secmlr/reduced_clean_dataset_filtered_together-deepseek-reasoner_train_len_8000_inputlen_5000
Viewer
• Updated • 4.98k • 4
secmlr/reduced_clean_dataset_filtered_QwQ-32B-Preview_train_len_8000_inputlen_5000
Viewer
• Updated • 5.16k • 2
Viewer
• Updated • 517k • 15
secmlr/prm_clean_dataset_filtered_QwQ-32B-Preview_train_len_8000_inputlen_5000
Viewer
• Updated • 1.37k • 4
secmlr/VD-DS-Clean-8k_VD-QWQ-Clean-8k_Qwen2.5-7B-Instruct_full_sft_1e-5_train_dpo
Viewer
• Updated • 5.45k • 2
secmlr/noisy_dataset_filtered_QwQ-32B-Preview_small_train_len_16000_inputlen_5000
Viewer
• Updated • 3.96k • 2
secmlr/noisy_dataset_filtered_QwQ-32B-Preview_small_train_len_8000_inputlen_5000
Viewer
• Updated • 3.95k • 3
secmlr/Sky-T1_preference_data_10k_filtered
Viewer
• Updated • 9.39k • 4
secmlr/VD-DS-Clean-8k_VD-QWQ-Clean-8k_Qwen2.5-7B-Instruct_full_sft_1e-5_train_dpo_filtered
Viewer
• Updated • 5.72k • 3
secmlr/Sky-T1_data_17k_filtered
Viewer
• Updated • 14.6k • 6
secmlr/clean_dataset_filtered_together-deepseek-reasoner_train_len_8000_inputlen_5000
Viewer
• Updated • 4.98k • 3
secmlr/clean_dataset_dsformat_filtered_QwQ-32B-Preview_train_len_8000_inputlen_5000
Viewer
• Updated • 5.16k • 4
secmlr/clean_dataset_dsformat_filtered_QwQ-32B-Preview_train_len_16000_inputlen_5000
Viewer
• Updated • 5.16k • 3
secmlr/clean_dataset_filtered_together-deepseek-reasoner_train_len_16000_inputlen_5000
Viewer
• Updated • 4.99k • 4
secmlr/clean_dataset_dsformat_filtered_together-deepseek-reasoner_train_len_8000_inputlen_5000
Viewer
• Updated • 4.98k • 3
secmlr/clean_dataset_dsformat_filtered_together-deepseek-reasoner_train_len_16000_inputlen_5000
Viewer
• Updated • 4.99k • 4
secmlr/clean_dataset_filtered_QwQ-32B-Preview_train_len_16000_inputlen_5000
Viewer
• Updated • 5.16k • 6