lucas Zhai
lucaszgc
AI & ML interests
None yet
Recent Activity
upvoted a paper about 4 hours ago
OPID: On-Policy Skill Distillation for Agentic Reinforcement Learning upvoted a paper about 1 month ago
Maestro: Reinforcement Learning to Orchestrate Hierarchical Model-Skill Ensembles upvoted a paper about 1 month ago
Self-Distilled Agentic Reinforcement LearningOrganizations
None yet