8 12 13

Yichen You

youyc22

youyc22

AI & ML interests

None yet

Recent Activity

published a dataset 1 day ago

TaH-plus/amteam-8b-121k

upvoted a paper 2 days ago

Qwen-AgentWorld: Language World Models for General Agents

updated a dataset 15 days ago

TaH-plus/eval_science_code

View all activity

Organizations

upvoted a paper 2 days ago

Qwen-AgentWorld: Language World Models for General Agents

Paper • 2606.24597 • Published 5 days ago • 132

upvoted a paper about 1 month ago

Post-Trained MoE Can Skip Half Experts via Self-Distillation

Paper • 2605.18643 • Published May 18 • 30

upvoted a paper 2 months ago

Rethinking On-Policy Distillation of Large Language Models: Phenomenology, Mechanism, and Recipe

Paper • 2604.13016 • Published Apr 14 • 113

upvoted a paper 4 months ago

How Far Can Unsupervised RLVR Scale LLM Training?

Paper • 2603.08660 • Published Mar 9 • 60

upvoted an article 7 months ago

Article

Continuous batching from first principles

ror, ArthurZ, mcpotato

•

Nov 25, 2025

• 411

upvoted a collection 7 months ago

TaH

Collection

Think-at-Hard: Selective Latent Iterations to Improve Reasoning Language Models • 9 items • Updated Apr 12 • 2

upvoted 2 papers 7 months ago

Think-at-Hard: Selective Latent Iterations to Improve Reasoning Language Models

Paper • 2511.08577 • Published Nov 11, 2025 • 110

P1: Mastering Physics Olympiads with Reinforcement Learning

Paper • 2511.13612 • Published Nov 17, 2025 • 135

upvoted 2 papers 9 months ago

Cache-to-Cache: Direct Semantic Communication Between Large Language Models

Paper • 2510.03215 • Published Oct 3, 2025 • 99

Latent Zoning Network: A Unified Principle for Generative Modeling, Representation Learning, and Classification

Paper • 2509.15591 • Published Sep 19, 2025 • 46

upvoted 2 papers about 1 year ago

PAROAttention: Pattern-Aware ReOrdering for Efficient Sparse and Quantized Attention in Visual Generation Models

Paper • 2506.16054 • Published Jun 19, 2025 • 60

R2R: Efficiently Navigating Divergent Reasoning Paths with Small-Large Model Token Routing

Paper • 2505.21600 • Published May 27, 2025 • 71

Yichen You

AI & ML interests

Recent Activity

Organizations

youyc22's activity

Continuous batching from first principles