shawnxzhu

2 14 1

AI & ML interests

None yet

Recent Activity

upvoted a paper 21 days ago

Demystifying Hidden-State Recurrence: Switchable Latent Reasoning with On-Policy Reinforcement Learning

upvoted a paper 28 days ago

Reinforcement Learning Elicits Contextual Learning of Unseen Language Translation

authored a paper about 1 month ago

EnvFactory: Scaling Tool-Use Agents via Executable Environments Synthesis and Robust RL

View all activity

Organizations

upvoted a paper 21 days ago

Demystifying Hidden-State Recurrence: Switchable Latent Reasoning with On-Policy Reinforcement Learning

Paper • 2606.13106 • Published 22 days ago • 21

upvoted a paper 28 days ago

Reinforcement Learning Elicits Contextual Learning of Unseen Language Translation

Paper • 2606.06428 • Published 29 days ago • 25

authored a paper about 1 month ago

EnvFactory: Scaling Tool-Use Agents via Executable Environments Synthesis and Robust RL

Paper • 2605.18703 • Published May 18 • 50

submitted a paper to Daily Papers about 1 month ago

EnvFactory: Scaling Tool-Use Agents via Executable Environments Synthesis and Robust RL

Paper • 2605.18703 • Published May 18 • 50

upvoted a paper about 1 month ago

EnvFactory: Scaling Tool-Use Agents via Executable Environments Synthesis and Robust RL

Paper • 2605.18703 • Published May 18 • 50

upvoted a paper 2 months ago

A Self-Evolving Framework for Efficient Terminal Agents via Observational Context Compression

Paper • 2604.19572 • Published Apr 21 • 23

upvoted 2 papers 4 months ago

ACE: Attribution-Controlled Knowledge Editing for Multi-hop Factual Recall

Paper • 2510.07896 • Published Oct 9, 2025 • 11

On Data Engineering for Scaling LLM Terminal Capabilities

Paper • 2602.21193 • Published Feb 24 • 103

authored 2 papers 4 months ago

CHARM: Calibrating Reward Models With Chatbot Arena Scores

Paper • 2504.10045 • Published Apr 14, 2025

CodeScaler: Scaling Code LLM Training and Test-Time Inference via Execution-Free Reward Models

Paper • 2602.17684 • Published Feb 4 • 22

updated a collection 4 months ago

CodeScaler

Collection

5 items • Updated Mar 2 • 6

upvoted a paper 4 months ago

CodeScaler: Scaling Code LLM Training and Test-Time Inference via Execution-Free Reward Models

Paper • 2602.17684 • Published Feb 4 • 22

upvoted a collection 4 months ago

CodeScaler

Collection

5 items • Updated Mar 2 • 6

published 3 models 4 months ago

published a dataset 4 months ago

LARK-Lab/CodeScalerPair-51K

Viewer • Updated Feb 23 • 51.1k • 23 • 1

updated a dataset 4 months ago

LARK-Lab/CodeScalerPair-51K

Viewer • Updated Feb 23 • 51.1k • 23 • 1

updated 2 models 4 months ago

LARK-Lab/CodeScaler-8B

Text Classification • 8B • Updated Feb 23 • 71

LARK-Lab/CodeScaler-4B

Text Classification • 4B • Updated Feb 23 • 4

shawnxzhu

AI & ML interests

Recent Activity

Organizations

shawnxzhu's activity