hamishivi/rl_rag_AR4_cb_rar_2k_norm_test_buffer__1__1758180701_step_200 8B • Updated Sep 20, 2025 • 1
hamishivi/rl_rag_AR4_cb_rar_2k_norm_test_buffer__1__1758180701_step_150 8B • Updated Sep 19, 2025 • 1
hamishivi/rl_rag_AR4_cb_rar_2k_norm_test_buffer__1__1758180701_step_100 8B • Updated Sep 19, 2025 • 2
hamishivi/0409_rl_rag_sft_mcp_bigger_batch_long__1__1757718571_step_300 8B • Updated Sep 15, 2025 • 1
hamishivi/0409_rl_rag_sft_mcp_bigger_batch_long__1__1757718571_step_100 8B • Updated Sep 13, 2025 • 1