hopix
hopix30456
AI & ML interests
None yet
Recent Activity
upvoted a paper about 17 hours ago
CEPO: RLVR Self-Distillation using Contrastive Evidence Policy Optimization upvoted a paper 7 months ago
Dr.LLM: Dynamic Layer Routing in LLMs upvoted a paper 11 months ago
AceReason-Nemotron 1.1: Advancing Math and Code Reasoning through SFT
and RL SynergyOrganizations
None yet