Ashima/qwen3_0.6b-rlvr_task010_mctaco_answer_generation_event_ordering Viewer • Updated Mar 5 • 128 • 4
Ashima/qwen3_0.6b-rlvr_task003_mctaco_question_generation_event_duration Viewer • Updated Mar 5 • 128 • 3
Ashima/qwen3_0.6b-rlvr_Feb24-1812_datamix_top10_augmented_needle_in_a_haystack_Feb25-0119 Viewer • Updated Feb 25 • 400 • 4
Ashima/qwen3_0.6b-rlvr_Feb24-1812_datamix_top10_augmented_finding_errors_in_reasoning_traces_Feb25-0038 Viewer • Updated Feb 25 • 400 • 4
Ashima/qwen3_0.6b-rlvr_Feb24-1812_datamix_top10_augmented_inductive_reasoning_Feb25-0035 Viewer • Updated Feb 25 • 400 • 3
Ashima/qwen3_0.6b-rlvr_Feb24-1812_datamix_top10_augmented_constraint_satisfaction_Feb25-0017 Viewer • Updated Feb 25 • 400 • 5
Ashima/qwen3_0.6b-rlvr_Feb24-1812_datamix_top10_augmented_compositional_understanding_Feb25-0012 Viewer • Updated Feb 25 • 400 • 4
Ashima/qwen3_0.6b-rlvr_Feb24-1812_datamix_top10_augmented_knowledge-intensive_reasoning_Feb25-0006 Viewer • Updated Feb 25 • 400 • 5
Ashima/qwen3_0.6b-rlvr_Feb24-1812_datamix_top10_augmented_Feb24-1927 Viewer • Updated Feb 25 • 400 • 1
Ashima/qwen3_0.6b-rlvr_Feb24-1812_datamix_top10_augmented_Feb24-1915 Viewer • Updated Feb 25 • 400 • 2
Ashima/qwen3_0.6b-rlvr_Feb24-1812_datamix_top10_augmented_Feb24-1909 Viewer • Updated Feb 25 • 400 • 2
Ashima/qwen3_0.6b-rlvr_Feb24-1812_datamix_top10_augmented_Feb24-1857 Viewer • Updated Feb 25 • 400 • 8
Ashima/qwen3_0.6b-rlvr_task967_ruletaker_incorrect_fact_generation_based_on_given_paragraph Viewer • Updated Feb 24 • 296 • 2