sachiniyer/Qwen2.5-1.5B-GRPO-PeakCapture-Schwinn-v21 Text Generation • 2B • Updated about 6 hours ago
sachiniyer/Qwen2.5-1.5B-GRPO-HighKL-Scaled-Schwinn-v19 Text Generation • 2B • Updated about 12 hours ago
sachiniyer/Qwen2.5-1.5B-GRPO-HighKL-Schwinn-v16 Text Generation • 2B • Updated about 21 hours ago • 11
sachiniyer/Qwen2.5-1.5B-GRPO-EarlyStop-Schwinn-v20 Text Generation • 2B • Updated about 21 hours ago • 8
sachiniyer/Qwen2.5-1.5B-GRPO-NegReward-Schwinn-v17 Text Generation • 2B • Updated about 22 hours ago • 10
sachiniyer/Qwen2.5-1.5B-GRPO-Count-Schwinn-v15 Text Generation • 2B • Updated about 22 hours ago • 11