Deventhedude 's Collections
Viewer
• Updated • 45k • 166
• 62
Viewer
• Updated • 1.87k • 2.08k
• 235
Preview
• Updated • 39
• 52
Viewer
• Updated • 4.11k • 1.02k
• 31
Viewer
• Updated • 52.5B • 253k
• 2.91k
NousResearch/hermes-function-calling-v1
Viewer
• Updated • 11.6k • 24.3k
• 423
agentica-org/DeepScaleR-Preview-Dataset
Viewer
• Updated • 40.3k • 21.6k
• 200
GLM-4.5: Agentic, Reasoning, and Coding (ARC) Foundation Models
Paper
• 2508.06471
• Published • 212
GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable
Reinforcement Learning
Paper
• 2507.01006
• Published • 257
Nanbeige4.1-3B: A Small General Model that Reasons, Aligns, and Acts
Paper
• 2602.13367
• Published • 36
Viewer
• Updated • 842k • 2.83k
• 116
nvidia/Nemotron-RL-agent-calendar_scheduling
Viewer
• Updated • 4k • 284
• 5
nvidia/Nemotron-RL-agent-workplace_assistant
Viewer
• Updated • 1.8k • 1.31k
• 29
nvidia/Nemotron-RL-Agentic-Conversational-Tool-Use-Pivot-v1
Viewer
• Updated • 97k • 5.99k
• 23
nvidia/Nemotron-RL-Agentic-Function-Calling-Pivot-v1
Viewer
• Updated • 9.62k • 1.69k
• 10