4 1

Jingcheng Liang

leoleung04

AI & ML interests

None yet

Recent Activity

upvoted a paper 1 day ago

Zone of Proximal Policy Optimization: Teacher in Prompts, Not Gradients

upvoted a paper about 2 months ago

Abstain-R1: Calibrated Abstention and Post-Refusal Clarification via Verifiable RL

upvoted a paper 2 months ago

SPPO: Sequence-Level PPO for Long-Horizon Reasoning Tasks

View all activity

Organizations

None yet

upvoted a paper 1 day ago

Zone of Proximal Policy Optimization: Teacher in Prompts, Not Gradients

Paper • 2606.18216 • Published 2 days ago • 43

upvoted a paper about 2 months ago

Abstain-R1: Calibrated Abstention and Post-Refusal Clarification via Verifiable RL

Paper • 2604.17073 • Published Apr 18 • 9

upvoted a paper 2 months ago

SPPO: Sequence-Level PPO for Long-Horizon Reasoning Tasks

Paper • 2604.08865 • Published Apr 10 • 29

liked a dataset 2 months ago

lime-nlp/OS-Blind

Updated Apr 14 • 19 • 6

upvoted a paper 2 months ago

The Blind Spot of Agent Safety: How Benign User Instructions Expose Critical Vulnerabilities in Computer-Use Agents

Paper • 2604.10577 • Published Apr 12 • 26

updated a collection 4 months ago

Abstain-R1

Collection

1 item • Updated Apr 17

updated a collection 5 months ago

Abstain-R1

Collection

1 item • Updated Apr 17

updated a collection 6 months ago

Abstain-R1

Collection

1 item • Updated Apr 17

updated a model 6 months ago

leoleung04/Abstain-R1

3B • Updated Jan 3 • 24

published a model 6 months ago

leoleung04/Abstain-R1

3B • Updated Jan 3 • 24

Jingcheng Liang

AI & ML interests

Recent Activity

Organizations

leoleung04's activity