Joseph

lkdsmr

9

AI & ML interests

None yet

Recent Activity

upvoted a paper 1 day ago

TRIAGE: Role-Typed Credit Assignment for Agentic Reinforcement Learning

upvoted a paper about 2 months ago

Beyond GRPO and On-Policy Distillation: An Empirical Sparse-to-Dense Reward Principle for Language-Model Post-Training

updated a collection 3 months ago

View all activity

Organizations

None yet

lkdsmr 's models

None public yet