This collection is used to store series of models on the project Reinforcement Learning for Reflection Ability of Math Reasoning Model.
SII-BoHuang
SII-BoHuang
AI & ML interests
SII is an institution dedicated to innovation in education and research in the field of AI. Bo Huang is part of SII, focusing on LLM Agent.
Recent Activity
upvoted a paper 4 days ago
MemQ: Integrating Q-Learning into Self-Evolving Memory Agents over Provenance DAGs upvoted a paper 16 days ago
MMSkills: Towards Multimodal Skills for General Visual Agents upvoted a paper about 1 month ago
Hybrid Policy Distillation for LLMsOrganizations
None yet