QuantTrio/Qwen3-235B-A22B-Instruct-2507-AWQ Text Generation • 235B • Updated Aug 19, 2025 • 20.4k • 12
deepseek-ai/DeepSeek-R1-Distill-Llama-70B Text Generation • 71B • Updated Feb 24, 2025 • 288k • • 781
Group-in-Group Policy Optimization for LLM Agent Training Paper • 2505.10978 • Published May 16, 2025 • 23