Yucheng Wang's picture

🤝 Open to Collab

4 1

Yucheng Wang

Echoandland

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 16 days ago

EvoArena: Tracking Memory Evolution for Robust LLM Agents in Dynamic Environments

upvoted a paper 4 months ago

Rewarding the Rare: Uniqueness-Aware RL for Creative Problem Solving in LLMs

authored a paper 5 months ago

Collaborative Multi-Agent Test-Time Reinforcement Learning for Reasoning

View all activity

Organizations

upvoted a paper 16 days ago

EvoArena: Tracking Memory Evolution for Robust LLM Agents in Dynamic Environments

Paper • 2606.13681 • Published 17 days ago • 142

upvoted a paper 4 months ago

Rewarding the Rare: Uniqueness-Aware RL for Creative Problem Solving in LLMs

Paper • 2601.08763 • Published Jan 13 • 150

authored a paper 5 months ago

Collaborative Multi-Agent Test-Time Reinforcement Learning for Reasoning

Paper • 2601.09667 • Published Jan 14 • 92

upvoted a paper 5 months ago

Collaborative Multi-Agent Test-Time Reinforcement Learning for Reasoning

Paper • 2601.09667 • Published Jan 14 • 92

updated 2 models 6 months ago

Echoandland/olmo3-7b-physics-grpo-purerl-step9

Reinforcement Learning • 7B • Updated Dec 26, 2025 • 4

Echoandland/olmo3-7b-physics-grpo-purerl-step7

Reinforcement Learning • 7B • Updated Dec 26, 2025 • 6

published 2 models 6 months ago

Echoandland/olmo3-7b-physics-grpo-purerl-step7

Reinforcement Learning • 7B • Updated Dec 26, 2025 • 6

Echoandland/olmo3-7b-physics-grpo-purerl-step9

Reinforcement Learning • 7B • Updated Dec 26, 2025 • 4

updated a model 6 months ago

Echoandland/qwen3-8b-dapo-high-entropy-step2

Reinforcement Learning • 8B • Updated Dec 24, 2025 • 4

published a model 6 months ago

Echoandland/qwen3-8b-dapo-high-entropy-step2

Reinforcement Learning • 8B • Updated Dec 24, 2025 • 4

updated a model 6 months ago

Echoandland/qwen3-8b-dapo-high-entropy-step8

Reinforcement Learning • 8B • Updated Dec 24, 2025 • 3

published a model 6 months ago

Echoandland/qwen3-8b-dapo-high-entropy-step8

Reinforcement Learning • 8B • Updated Dec 24, 2025 • 3

updated a model 6 months ago

Echoandland/olmo3-7b-grpo-weighted-mul-creativity-step6

Reinforcement Learning • 7B • Updated Dec 23, 2025 • 3

published a model 6 months ago

Echoandland/olmo3-7b-grpo-weighted-mul-creativity-step6

Reinforcement Learning • 7B • Updated Dec 23, 2025 • 3

updated a model 6 months ago

Echoandland/olmo3-7b-grpo-weighted-mul-creativity-step7

Reinforcement Learning • 7B • Updated Dec 23, 2025 • 3

published a model 6 months ago

Echoandland/olmo3-7b-grpo-weighted-mul-creativity-step7

Reinforcement Learning • 7B • Updated Dec 23, 2025 • 3

updated a model 6 months ago

Echoandland/olmo3-7b-grpo-purerl-creativity-step28

Reinforcement Learning • 7B • Updated Dec 23, 2025 • 4

published a model 6 months ago

Echoandland/olmo3-7b-grpo-purerl-creativity-step28

Reinforcement Learning • 7B • Updated Dec 23, 2025 • 4

updated a model 6 months ago

Echoandland/olmo3-7b-grpo-purerl-creativity-step5

Reinforcement Learning • 7B • Updated Dec 23, 2025 • 2

published a model 6 months ago

Echoandland/olmo3-7b-grpo-purerl-creativity-step5

Reinforcement Learning • 7B • Updated Dec 23, 2025 • 2