Collections
Discover the best community collections!
Collections trending this week
-
mats-10-sprint-cs-jb/cot-oracle-eval-atypical-answer-riya
Viewer • Updated • 100 • 57 -
mats-10-sprint-cs-jb/cot-oracle-eval-cybercrime-ood
Viewer • Updated • 100 • 52 -
mats-10-sprint-cs-jb/cot-oracle-eval-forced-answer-entropy-riya
Viewer • Updated • 200 • 36 -
mats-10-sprint-cs-jb/cot-oracle-eval-reasoning-termination-riya
Viewer • Updated • 84 • 53
-
VerlTool: Towards Holistic Agentic Reinforcement Learning with Tool Use
Paper • 2509.01055 • Published • 79 -
AgentRL: Scaling Agentic Reinforcement Learning with a Multi-Turn, Multi-Task Framework
Paper • 2510.04206 • Published • 3 -
In-the-Flow Agentic System Optimization for Effective Planning and Tool Use
Paper • 2510.05592 • Published • 108 -
D-CORE: Incentivizing Task Decomposition in Large Reasoning Models for Complex Tool Use
Paper • 2602.02160 • Published • 13
-
VerlTool: Towards Holistic Agentic Reinforcement Learning with Tool Use
Paper • 2509.01055 • Published • 79 -
AgentRL: Scaling Agentic Reinforcement Learning with a Multi-Turn, Multi-Task Framework
Paper • 2510.04206 • Published • 3 -
In-the-Flow Agentic System Optimization for Effective Planning and Tool Use
Paper • 2510.05592 • Published • 108 -
D-CORE: Incentivizing Task Decomposition in Large Reasoning Models for Complex Tool Use
Paper • 2602.02160 • Published • 13
-
mats-10-sprint-cs-jb/cot-oracle-eval-atypical-answer-riya
Viewer • Updated • 100 • 57 -
mats-10-sprint-cs-jb/cot-oracle-eval-cybercrime-ood
Viewer • Updated • 100 • 52 -
mats-10-sprint-cs-jb/cot-oracle-eval-forced-answer-entropy-riya
Viewer • Updated • 200 • 36 -
mats-10-sprint-cs-jb/cot-oracle-eval-reasoning-termination-riya
Viewer • Updated • 84 • 53