QRQ
RichardQRQ
AI & ML interests
None yet
Recent Activity
upvoted a paper about 3 hours ago
Beyond SFT-to-RL: Pre-alignment via Black-Box On-Policy Distillation for Multimodal RL upvoted a paper 7 days ago
Recursive Multi-Agent SystemsOrganizations
None yet