rl-rag/drtulu_v2_arena_expert
Viewer
• Updated
• 2.74k • 14
rl-rag/drtulu_v2_personalfinance
Viewer
• Updated
• 18.8k • 13
rl-rag/drtulu_v2_nemotron_web_search_mcqa
Viewer
• Updated
• 1.95k • 10
rl-rag/1_sample_toy_rag_survey
Viewer
• Updated
• 8 • 11
Viewer
• Updated
• 30 • 6
rl-rag/rl-rag-RaR-Medicine-3k-o3-mini-converted
Viewer
• Updated
• 3k • 10
rl-rag/dpo_lf_sft0921_rubric_citation
Viewer
• Updated
• 1.32k • 8
rl-rag/sft_rejection_sampled_on_policy_long-_form_sft_0921
Viewer
• Updated
• 2.22k • 8
rl-rag/dpo_long_form_gpt5_sft_0921
Viewer
• Updated
• 3.37k • 13
rl-rag/sft_0921_onpolicy_rejection_sampled
Viewer
• Updated
• 1.9k • 8
rl-rag/dpo_gpt5_our_sft_0921
Preview
• Updated
• 7
rl-rag/dpo_our_sft_0921_two_iterations
Viewer
• Updated
• 705 • 7
rl-rag/sft-mix-v20250921_long_form_only_04
Viewer
• Updated
• 3.5k • 7
rl-rag/sft-mix-v20250921_long_form_only_05
Viewer
• Updated
• 3.5k • 6
rl-rag/sft-mix-v20250921_short_form_only_05
Viewer
• Updated
• 2.5k • 7
rl-rag/hle_rlvr_no_prompt
Viewer
• Updated
• 500 • 267
rl-rag/searcharena_query_scores
Viewer
• Updated
• 8.78k • 7
rl-rag/sft-mix-v20250921_short_form_only
Viewer
• Updated
• 5.8k • 8
rl-rag/sft-mix-v20250921_long_form_only
Viewer
• Updated
• 10.3k • 29
rl-rag/sft-mix-v20250921_05
Viewer
• Updated
• 8k • 9
rl-rag/sft-mix-v20250921_02
Viewer
• Updated
• 3.2k • 7
rl-rag/sft-mix-v20250921_01
Viewer
• Updated
• 1.6k • 10
rl-rag/sft-mix-v20250921_005
Viewer
• Updated
• 800 • 8
rl-rag/rl_rag_train_sa_3k_longform_rubrics
Viewer
• Updated
• 2.94k • 6
rl-rag/rl_rag_sqa_searcharena_rubrics_web_augmented_rubrics_only_call_tool
Viewer
• Updated
• 2.94k • 5
rl-rag/rl_rag_sqa_searcharena_rubrics_web_augmented_rubrics_only_with_new_mcp_system_prompt
Viewer
• Updated
• 2.94k • 8
rl-rag/rl_rag_sqa_searcharena_rubrics_web_augmented_longform_averaged_outcome_with_system_prompt
Viewer
• Updated
• 2.94k • 6
rl-rag/rl_rag_sqa_searcharena_rubrics_web_augmented_outcome_with_new_mcp_system_prompt
Viewer
• Updated
• 2.94k • 5
rl-rag/gpqa_diamond_rlvr_no_prompt
Viewer
• Updated
• 198 • 63
rl-rag/nq_rlvr_no_prompt_f1_test
Viewer
• Updated
• 3.61k • 5