hazz's picture

2 2

hazz

manakanemu

AI & ML interests

None yet

Recent Activity

upvoted a paper 1 day ago

LongTraceRL: Learning Long-Context Reasoning from Search Agent Trajectories with Rubric Rewards

upvoted a paper 5 months ago

Chaining the Evidence: Robust Reinforcement Learning for Deep Search Agents with Citation-Aware Rubric Rewards

liked a dataset almost 2 years ago

mlfoundations/dclm-baseline-1.0

View all activity

Organizations

None yet

upvoted a paper 1 day ago

LongTraceRL: Learning Long-Context Reasoning from Search Agent Trajectories with Rubric Rewards

Paper • 2605.31584 • Published 5 days ago • 36

upvoted a paper 5 months ago

Chaining the Evidence: Robust Reinforcement Learning for Deep Search Agents with Citation-Aware Rubric Rewards

Paper • 2601.06021 • Published Jan 9 • 48

liked 2 datasets almost 2 years ago

mlfoundations/dclm-baseline-1.0

Preview • Updated Jul 22, 2024 • 624k • 280

mlfoundations/DataComp-12M

Preview • Updated Jun 26, 2024 • 1.92k • 12