Zhilong Zheng's picture

Zhilong Zheng

zzzzl-h

AI & ML interests

None yet

Recent Activity

authored a paper about 9 hours ago

STAPO: Stabilizing Reinforcement Learning for LLMs by Silencing Rare Spurious Tokens

View all activity

Organizations

None yet

authored a paper about 9 hours ago

STAPO: Stabilizing Reinforcement Learning for LLMs by Silencing Rare Spurious Tokens

Paper • 2602.15620 • Published 1 day ago • 2