10 41 8

wuyuhao

mozhu

AI & ML interests

None yet

Recent Activity

upvoted a paper 24 days ago

Diagnosing Harmful Continuation in Answer-Correct Long-CoT Training Traces

submitted a paper 24 days ago

Diagnosing Harmful Continuation in Answer-Correct Long-CoT Training Traces

upvoted a paper about 1 month ago

δ-mem: Efficient Online Memory for Large Language Models

View all activity

Organizations

upvoted a paper 24 days ago

Diagnosing Harmful Continuation in Answer-Correct Long-CoT Training Traces

Paper • 2605.29288 • Published about 1 month ago • 9

upvoted a paper about 1 month ago

δ-mem: Efficient Online Memory for Large Language Models

Paper • 2605.12357 • Published May 12 • 131

upvoted a paper 2 months ago

The Past Is Not Past: Memory-Enhanced Dynamic Reward Shaping

Paper • 2604.11297 • Published Apr 13 • 144

upvoted 2 papers 3 months ago

MiroThinker-1.7 & H1: Towards Heavy-Duty Research Agents via Verification

Paper • 2603.15726 • Published Mar 16 • 187

Attention Residuals

Paper • 2603.15031 • Published Mar 16 • 189

upvoted 5 papers 4 months ago

upvoted 2 papers 5 months ago

Retrieval-Infused Reasoning Sandbox: A Benchmark for Decoupling Retrieval and Reasoning Capabilities

Paper • 2601.21937 • Published Jan 29 • 20

Kimi K2.5: Visual Agentic Intelligence

Paper • 2602.02276 • Published Feb 2 • 275

upvoted 2 papers 8 months ago

Kimi Linear: An Expressive, Efficient Attention Architecture

Paper • 2510.26692 • Published Oct 30, 2025 • 133

Glyph: Scaling Context Windows via Visual-Text Compression

Paper • 2510.17800 • Published Oct 20, 2025 • 69

upvoted a paper 9 months ago

Language Models Can Learn from Verbal Feedback Without Scalar Rewards

Paper • 2509.22638 • Published Sep 26, 2025 • 70

upvoted a paper 11 months ago

GLM-4.5: Agentic, Reasoning, and Coding (ARC) Foundation Models

Paper • 2508.06471 • Published Aug 8, 2025 • 212

upvoted a paper 12 months ago

GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning

Paper • 2507.01006 • Published Jul 1, 2025 • 257

upvoted 2 papers about 1 year ago

LongWriter-Zero: Mastering Ultra-Long Text Generation via Reinforcement Learning

Paper • 2506.18841 • Published Jun 23, 2025 • 57

SuperWriter: Reflection-Driven Long-Form Generation with Large Language Models

Paper • 2506.04180 • Published Jun 4, 2025 • 35

upvoted a paper over 1 year ago

DAPO: An Open-Source LLM Reinforcement Learning System at Scale

Paper • 2503.14476 • Published Mar 18, 2025 • 146

wuyuhao

AI & ML interests

Recent Activity

Organizations

mozhu's activity