Jason Tse

jason-tse

AI & ML interests

None yet

Recent Activity

published a dataset 23 days ago

jason-tse/PhyAV-Sound-11K

upvoted a paper 2 months ago

Nonsense Helps: Prompt Space Perturbation Broadens Reasoning Exploration

upvoted an article 11 months ago

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

View all activity

Organizations

None yet

upvoted a paper 2 months ago

Nonsense Helps: Prompt Space Perturbation Broadens Reasoning Exploration

Paper • 2605.05566 • Published May 7 • 38

upvoted an article 11 months ago

Article

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

NormalUhr

•

Feb 7, 2025

• 296