Collections
Discover the best community collections!
Collections trending this week
-
Nemotron-Cascade 2: Post-Training LLMs with Cascade RL and Multi-Domain On-Policy Distillation
Paper • 2603.19220 • Published • 69 -
Not Every Rubric Teaches Equally: Policy-Aware Rubric Rewards for RLVR
Paper • 2605.20164 • Published • 6 -
GoLongRL: Capability-Oriented Long Context Reinforcement Learning with Multitask Alignment
Paper • 2605.19577 • Published • 59 -
EnvFactory: Scaling Tool-Use Agents via Executable Environments Synthesis and Robust RL
Paper • 2605.18703 • Published • 50
-
ai-sage/GigaChat3.1-702B-A36B-GGUF
Text Generation • 702B • Updated • 385 • 16 -
ai-sage/GigaChat3.1-702B-A36B
Text Generation • 715B • Updated • 1.41k • 29 -
ai-sage/GigaChat3.1-702B-A36B-bf16
Text Generation • 715B • Updated • 27 • 6 -
ai-sage/GigaChat3.1-10B-A1.8B-GGUF
Text Generation • 11B • Updated • 3.55k • 74
-
HopChain: Multi-Hop Data Synthesis for Generalizable Vision-Language Reasoning
Paper • 2603.17024 • Published • 110 -
WorldAgents: Can Foundation Image Models be Agents for 3D World Models?
Paper • 2603.19708 • Published • 13 -
MACRO: Advancing Multi-Reference Image Generation with Structured Long-Context Data
Paper • 2603.25319 • Published • 32 -
ArtHOI: Taming Foundation Models for Monocular 4D Reconstruction of Hand-Articulated-Object Interactions
Paper • 2603.25791 • Published • 7
-
ai-sage/GigaChat3.1-702B-A36B-GGUF
Text Generation • 702B • Updated • 385 • 16 -
ai-sage/GigaChat3.1-702B-A36B
Text Generation • 715B • Updated • 1.41k • 29 -
ai-sage/GigaChat3.1-702B-A36B-bf16
Text Generation • 715B • Updated • 27 • 6 -
ai-sage/GigaChat3.1-10B-A1.8B-GGUF
Text Generation • 11B • Updated • 3.55k • 74
-
HopChain: Multi-Hop Data Synthesis for Generalizable Vision-Language Reasoning
Paper • 2603.17024 • Published • 110 -
WorldAgents: Can Foundation Image Models be Agents for 3D World Models?
Paper • 2603.19708 • Published • 13 -
MACRO: Advancing Multi-Reference Image Generation with Structured Long-Context Data
Paper • 2603.25319 • Published • 32 -
ArtHOI: Taming Foundation Models for Monocular 4D Reconstruction of Hand-Articulated-Object Interactions
Paper • 2603.25791 • Published • 7
-
Nemotron-Cascade 2: Post-Training LLMs with Cascade RL and Multi-Domain On-Policy Distillation
Paper • 2603.19220 • Published • 69 -
Not Every Rubric Teaches Equally: Policy-Aware Rubric Rewards for RLVR
Paper • 2605.20164 • Published • 6 -
GoLongRL: Capability-Oriented Long Context Reinforcement Learning with Multitask Alignment
Paper • 2605.19577 • Published • 59 -
EnvFactory: Scaling Tool-Use Agents via Executable Environments Synthesis and Robust RL
Paper • 2605.18703 • Published • 50