YijuGuo

YijuGuo

openbmb

·

https://yijuguo.github.io/

AI & ML interests

LLM Alignment

Recent Activity

upvoted a paper 20 days ago

Redesign Mixture-of-Experts Routers with Manifold Power Iteration

upvoted a paper 26 days ago

Rethinking Continual Experience Internalization for Self-Evolving LLM Agents

liked a Space 2 months ago

duoan/TorchCode

View all activity

Organizations

upvoted a paper 20 days ago

Redesign Mixture-of-Experts Routers with Manifold Power Iteration

Paper • 2606.12397 • Published 21 days ago • 89

upvoted a paper 26 days ago

Rethinking Continual Experience Internalization for Self-Evolving LLM Agents

Paper • 2606.04703 • Published 28 days ago • 25

upvoted a paper 3 months ago

AgentProcessBench: Diagnosing Step-Level Process Quality in Tool-Using Agents

Paper • 2603.14465 • Published Mar 15 • 23

upvoted 4 papers 5 months ago

Learning beyond Teacher: Generalized On-Policy Distillation with Reward Extrapolation

Paper • 2602.12125 • Published Feb 12 • 68

AgentCPM-Report: Interleaving Drafting and Deepening for Open-Ended Deep Research

Paper • 2602.06540 • Published Feb 6 • 22

DARC: Decoupled Asymmetric Reasoning Curriculum for LLM Evolution

Paper • 2601.13761 • Published Jan 20 • 16

Less Noise, More Voice: Reinforcement Learning for Reasoning via Instruction Purification

Paper • 2601.21244 • Published Jan 29 • 12