5 57 4

TongZheng PRO

TongZheng1999

https://kidzheng.github.io/

AI & ML interests

Natural Language Processing

Recent Activity

upvoted a paper about 20 hours ago

Share More, Search Less: Collaborative Parallel Thinking for Efficient Test-Time Scaling

upvoted a paper 9 days ago

Model-Adaptive Tool Necessity Reveals the Knowing-Doing Gap in LLM Tool Use

authored a paper 10 days ago

LLMs Improving LLMs: Agentic Discovery for Test-Time Scaling

View all activity

Organizations

upvoted a paper about 20 hours ago

Share More, Search Less: Collaborative Parallel Thinking for Efficient Test-Time Scaling

Paper • 2605.27030 • Published 2 days ago • 24

upvoted a paper 9 days ago

Model-Adaptive Tool Necessity Reveals the Knowing-Doing Gap in LLM Tool Use

Paper • 2605.14038 • Published 15 days ago • 15

upvoted a paper 13 days ago

EvolveMem:Self-Evolving Memory Architecture via AutoResearch for LLM Agents

Paper • 2605.13941 • Published 15 days ago • 24

upvoted 3 papers 16 days ago

G-Zero: Self-Play for Open-Ended Generation from Zero Data

Paper • 2605.09959 • Published 17 days ago • 17

DeltaRubric: Generative Multimodal Reward Modeling via Joint Planning and Verification

Paper • 2605.09269 • Published 18 days ago • 6

Reinforcing Multimodal Reasoning Against Visual Degradation

Paper • 2605.09262 • Published 18 days ago • 7

upvoted a paper 17 days ago

LLMs Improving LLMs: Agentic Discovery for Test-Time Scaling

Paper • 2605.08083 • Published 20 days ago • 68

upvoted 2 papers 20 days ago

On Time, Within Budget: Constraint-Driven Online Resource Allocation for Agentic Workflows

Paper • 2605.06110 • Published 21 days ago • 16

Nonsense Helps: Prompt Space Perturbation Broadens Reasoning Exploration

Paper • 2605.05566 • Published 21 days ago • 37

upvoted a paper 2 months ago

MetaClaw: Just Talk -- An Agent That Meta-Learns and Evolves in the Wild

Paper • 2603.17187 • Published Mar 17 • 140

upvoted a paper 3 months ago

PhyCritic: Multimodal Critic Models for Physical AI

Paper • 2602.11124 • Published Feb 11 • 55

upvoted 8 papers 4 months ago

OPE: Overcoming Information Saturation in Parallel Thinking via Outline-Guided Path Exploration

Paper • 2602.08344 • Published Feb 9 • 5

SkillRL: Evolving Agents via Recursive Skill-Augmented Reinforcement Learning

Paper • 2602.08234 • Published Feb 9 • 76

Training Data Efficiency in Multimodal Process Reward Models

Paper • 2602.04145 • Published Feb 4 • 80

CoBA-RL: Capability-Oriented Budget Allocation for Reinforcement Learning in LLMs

Paper • 2602.03048 • Published Feb 3 • 32

Learning Query-Specific Rubrics from Human Preferences for DeepResearch Report Generation

Paper • 2602.03619 • Published Feb 3 • 28

upvoted a paper 5 months ago

RelayLLM: Efficient Reasoning via Collaborative Decoding

Paper • 2601.05167 • Published Jan 8 • 31

TongZheng PRO

AI & ML interests

Recent Activity

Organizations

TongZheng1999's activity