rl-rag/tqa_rlvr_no_prompt_f1_test
Viewer
• Updated
• 17.9k • 6
rl-rag/hotpotqa_rlvr_no_prompt_f1_test
Viewer
• Updated
• 7.41k • 4
rl-rag/2wiki_rlvr_no_prompt_f1_test
Viewer
• Updated
• 300 • 11
rl-rag/asearcher_short_form_rlvr_with_system_prompt
Viewer
• Updated
• 70.6k • 6
rl-rag/verified_miro_trajectories
Viewer
• Updated
• 9.88k • 9
rl-rag/rl_rag_sqa_openscholar_rubrics_s2_augmented_longform_averaged_outcome_with_system_prompt
Viewer
• Updated
• 2.42k • 5
rl-rag/combined-sft-training-data-v20250824_MiroSystemPrompt
Viewer
• Updated
• 4.44k • 6
Viewer
• Updated
• 3.99k • 14
rl-rag/rl_rag_sqa_no_retrieval_1k_longform_finegrained_with_system_prompt
Viewer
• Updated
• 999 • 5
rl-rag/rl_rag_sqa_no_retrieval_1k_longform_averaged_outcome_with_system_prompt
Viewer
• Updated
• 999 • 5
rl-rag/rl_rag_no_retrieval_1k_longform_rubrics_only_with_system_prompt
Viewer
• Updated
• 999 • 5
rl-rag/gpt-oss-20b-eval-react-serper
Updated
• 32
rl-rag/verifiable_synthetic_1k_0814
Viewer
• Updated
• 1.05k • 4
rl-rag/verifiable_synthetic_varied_depth_o3_verified
Viewer
• Updated
• 101 • 5
rl-rag/verifiable_synthetic_depth_one_v2_verified
Viewer
• Updated
• 114 • 8
rl-rag/combined-sft-training-data-v20250724
Viewer
• Updated
• 568 • 10
rl-rag/qwq_32b_factualqa_sft_data
Viewer
• Updated
• 36.5k • 8