1 16 1

li sheng

bambisheng

https://github.com/BambiSheng

AI & ML interests

None yet

Recent Activity

authored a paper 3 days ago

Qwen-AgentWorld: Language World Models for General Agents

upvoted a collection 4 days ago

Qwen-AgentWorld

upvoted a paper 4 days ago

NatureBench: Can Coding Agents Match the Published SOTA of Nature-Family Papers?

View all activity

Organizations

authored a paper 3 days ago

Qwen-AgentWorld: Language World Models for General Agents

Paper • 2606.24597 • Published 5 days ago • 136

upvoted a collection 4 days ago

Qwen-AgentWorld

Collection

3 items • Updated 4 days ago • 53

upvoted 2 papers 4 days ago

NatureBench: Can Coding Agents Match the Published SOTA of Nature-Family Papers?

Paper • 2606.24530 • Published 5 days ago • 61

Qwen-AgentWorld: Language World Models for General Agents

Paper • 2606.24597 • Published 5 days ago • 136

upvoted a paper 26 days ago

Draft-OPD: On-Policy Distillation for Speculative Draft Models

Paper • 2605.29343 • Published May 28 • 36

New activity in TsinghuaC3I/ZEDA-Evaluation about 1 month ago

Add dataset card and link to paper/code

👍 1

#1 opened about 1 month ago by

nielsr

upvoted a collection about 1 month ago

ZEDA

Collection

4 items • Updated May 19 • 3

authored a paper about 1 month ago

Post-Trained MoE Can Skip Half Experts via Self-Distillation

Paper • 2605.18643 • Published May 18 • 30

upvoted a paper about 1 month ago

Post-Trained MoE Can Skip Half Experts via Self-Distillation

Paper • 2605.18643 • Published May 18 • 30

updated a dataset about 1 month ago

TsinghuaC3I/ZEDA-Evaluation

Preview • Updated May 21 • 153

published a dataset about 1 month ago

TsinghuaC3I/ZEDA-Evaluation

Preview • Updated May 21 • 153

upvoted a paper 2 months ago

Rethinking On-Policy Distillation of Large Language Models: Phenomenology, Mechanism, and Recipe

Paper • 2604.13016 • Published Apr 14 • 113

upvoted a paper 3 months ago

Think-at-Hard: Selective Latent Iterations to Improve Reasoning Language Models

Paper • 2511.08577 • Published Nov 11, 2025 • 110

updated a dataset 3 months ago

dynn-datasets/Evaluation

Preview • Updated Mar 24 • 61

published a dataset 3 months ago

dynn-datasets/Evaluation

Preview • Updated Mar 24 • 61

upvoted a paper 4 months ago

How Far Can Unsupervised RLVR Scale LLM Training?

Paper • 2603.08660 • Published Mar 9 • 60

upvoted a paper 8 months ago

Scaling Latent Reasoning via Looped Language Models

Paper • 2510.25741 • Published Oct 29, 2025 • 233

upvoted 2 papers 10 months ago

rStar2-Agent: Agentic Reasoning Technical Report

Paper • 2508.20722 • Published Aug 28, 2025 • 120

SSRL: Self-Search Reinforcement Learning

Paper • 2508.10874 • Published Aug 14, 2025 • 97

upvoted a paper about 1 year ago

The Entropy Mechanism of Reinforcement Learning for Reasoning Language Models

Paper • 2505.22617 • Published May 28, 2025 • 132

li sheng

AI & ML interests

Recent Activity

Organizations

bambisheng's activity

Add dataset card and link to paper/code