hamishivi/1309_rl_rag_sft_mcp_rubric_lfrs_NAR8.5__1__1758928287_step_350 8B • Updated Oct 1, 2025 • 1
hamishivi/1309_rl_rag_sft_mcp_rubric_lfrs_NAR7.6__1__1758525156_step_400 8B • Updated Oct 1, 2025 • 1
hamishivi/1309_rl_rag_sft_mcp_rubric_lfrs_NAR8.5__1__1758525087_step_200 8B • Updated Oct 1, 2025 • 1
hamishivi/1309_rl_rag_sft_mcp_rubric_lfrs_NAR7.5__1__1758524910_step_400 8B • Updated Oct 1, 2025 • 1
hamishivi/rl-rag-qwen3-32b-sft-mix-0921_no_simple_short_form__123__1759088284 Updated Sep 28, 2025 • 1
hamishivi/1309_rl_rag_sft_mcp_all_apaptive_rubric__1__1758491414_step_700 8B • Updated Sep 28, 2025 • 1