Collections
Discover the best community collections!
Collections trending this week
-
EXAONE 4.0: Unified Large Language Models Integrating Non-reasoning and Reasoning Modes
Paper • 2507.11407 • Published • 62 -
LGAI-EXAONE/EXAONE-4.0-1.2B
Text Generation • 1B • Updated • 21.2k • 185 -
LGAI-EXAONE/EXAONE-4.0-32B
Text Generation • 32B • Updated • 30k • 281 -
LGAI-EXAONE/EXAONE-4.0-1.2B-FP8
Text Generation • 1B • Updated • 1.13k • 12
-
deepcogito/cogito-v2-preview-deepseek-671B-MoE
Text Generation • 671B • Updated • 21 • 37 -
deepcogito/cogito-v2-preview-llama-405B
Text Generation • 406B • Updated • 17 • 14 -
deepcogito/cogito-v2-preview-llama-109B-MoE
Image-Text-to-Text • 109B • Updated • 128 • 34 -
deepcogito/cogito-v2-preview-llama-70B
Text Generation • 71B • Updated • 52 • 25
-
spiral-rl/Spiral-Qwen3-4B
Text Generation • 4B • Updated • 95 • 4 -
spiral-rl/Spiral-DeepSeek-R1-Distill-Qwen-7B
Text Generation • 8B • Updated • 4 • 2 -
spiral-rl/Spiral-Kuhn-Poker-Qwen3-32B-SFT
Viewer • Updated • 25.5k • 11 -
SPIRAL: Self-Play on Zero-Sum Games Incentivizes Reasoning via Multi-Agent Multi-Turn Reinforcement Learning
Paper • 2506.24119 • Published • 51
-
deepcogito/cogito-v2-preview-deepseek-671B-MoE
Text Generation • 671B • Updated • 21 • 37 -
deepcogito/cogito-v2-preview-llama-405B
Text Generation • 406B • Updated • 17 • 14 -
deepcogito/cogito-v2-preview-llama-109B-MoE
Image-Text-to-Text • 109B • Updated • 128 • 34 -
deepcogito/cogito-v2-preview-llama-70B
Text Generation • 71B • Updated • 52 • 25
-
EXAONE 4.0: Unified Large Language Models Integrating Non-reasoning and Reasoning Modes
Paper • 2507.11407 • Published • 62 -
LGAI-EXAONE/EXAONE-4.0-1.2B
Text Generation • 1B • Updated • 21.2k • 185 -
LGAI-EXAONE/EXAONE-4.0-32B
Text Generation • 32B • Updated • 30k • 281 -
LGAI-EXAONE/EXAONE-4.0-1.2B-FP8
Text Generation • 1B • Updated • 1.13k • 12
-
spiral-rl/Spiral-Qwen3-4B
Text Generation • 4B • Updated • 95 • 4 -
spiral-rl/Spiral-DeepSeek-R1-Distill-Qwen-7B
Text Generation • 8B • Updated • 4 • 2 -
spiral-rl/Spiral-Kuhn-Poker-Qwen3-32B-SFT
Viewer • Updated • 25.5k • 11 -
SPIRAL: Self-Play on Zero-Sum Games Incentivizes Reasoning via Multi-Agent Multi-Turn Reinforcement Learning
Paper • 2506.24119 • Published • 51