Kazuki1450/Olmo-3-1025-7B_dsum_3_6_1p0_0p5_1p0_grpo_42_rule Text Generation • 7B • Updated about 1 hour ago
Kazuki1450/Olmo-3-1025-7B_dsum_3_6_1p0_0p2_1p0_grpo_42_rule Text Generation • 7B • Updated about 2 hours ago
Kazuki1450/Olmo-3-1025-7B_dsum_3_6_sgnrel_up_1e0_1p0_0p0_1p0_grpo_42_rule Text Generation • 7B • Updated about 2 hours ago
Kazuki1450/Qwen3-1.7B-Base_dsum_3_6_tok_python_alt_1_per_5_1p0_0p0_1p0_grpo_42_rule Text Generation • 2B • Updated about 2 hours ago
Kazuki1450/Qwen3-1.7B-Base_dsum_3_6_tok_python_alt_1_per_10_1p0_0p0_1p0_grpo_42_rule Text Generation • 2B • Updated about 2 hours ago
Kazuki1450/Qwen3-1.7B-Base_dsum_3_6_tok_Certainly_alt_1_per_5_1p0_0p0_1p0_grpo_42_rule Text Generation • 2B • Updated about 3 hours ago
Kazuki1450/Olmo-3-1025-7B_dsum_3_6_sgnrel_up_1e-1_1p0_0p0_1p0_grpo_42_rule Text Generation • 7B • Updated about 3 hours ago
Kazuki1450/Olmo-3-1025-7B_dsum_3_6_sgnrel_up_1e1_1p0_0p0_1p0_grpo_42_rule Text Generation • 7B • Updated about 3 hours ago
Kazuki1450/Qwen3-1.7B-Base_dsum_3_6_tok_Certainly_alt_1_per_2_1p0_0p0_1p0_grpo_42_rule Text Generation • 2B • Updated about 4 hours ago
Kazuki1450/Qwen3-1.7B-Base_dsum_3_6_tok_python_alt_1_per_2_1p0_0p0_1p0_grpo_42_rule Text Generation • 2B • Updated about 4 hours ago
Kazuki1450/Qwen3-1.7B-Base_dsum_3_6_1p0_0p5_1p0_grpo_42_rule Text Generation • 2B • Updated about 6 hours ago