Jackrong/Qwen3.5-27B-Claude-4.6-Opus-Reasoning-Distilled-v2 Image-Text-to-Text • 28B • Updated Apr 6 • 631k • 122
Absolute Zero: Reinforced Self-play Reasoning with Zero Data Paper • 2505.03335 • Published May 6, 2025 • 192