Joseph

lkdsmr

9

AI & ML interests

None yet

Recent Activity

upvoted a paper 1 day ago

TRIAGE: Role-Typed Credit Assignment for Agentic Reinforcement Learning

upvoted a paper about 2 months ago

Beyond GRPO and On-Policy Distillation: An Empirical Sparse-to-Dense Reward Principle for Language-Model Post-Training

updated a collection 3 months ago

View all activity

Organizations

None yet

upvoted a paper 1 day ago

TRIAGE: Role-Typed Credit Assignment for Agentic Reinforcement Learning

Paper • 2606.32017 • Published 3 days ago • 7

upvoted a paper about 2 months ago

Beyond GRPO and On-Policy Distillation: An Empirical Sparse-to-Dense Reward Principle for Language-Model Post-Training

Paper • 2605.12483 • Published May 12 • 10

updated a collection 3 months ago

papers

4 items • Updated Apr 16

upvoted a paper 3 months ago

TIP: Token Importance in On-Policy Distillation

Paper • 2604.14084 • Published Apr 15 • 15

updated a collection 4 months ago

papers

4 items • Updated Apr 16

upvoted 4 papers 4 months ago

IndexCache: Accelerating Sparse Attention via Cross-Layer Index Reuse

Paper • 2603.12201 • Published Mar 12 • 60

On-Policy Self-Distillation for Reasoning Compression

Paper • 2603.05433 • Published Mar 5 • 9

Not all tokens are needed(NAT): token efficient reinforcement learning

Paper • 2603.06619 • Published Feb 20 • 1

Overconfident Errors Need Stronger Correction: Asymmetric Confidence Penalties for Reinforcement Learning

Paper • 2602.21420 • Published Feb 24 • 6

updated a collection 4 months ago

papers

4 items • Updated Apr 16

upvoted 2 papers 4 months ago

PACED: Distillation at the Frontier of Student Competence

Paper • 2603.11178 • Published Mar 11 • 4

Spatial-TTT: Streaming Visual-based Spatial Intelligence with Test-Time Training

Paper • 2603.12255 • Published Mar 12 • 91