Open-Sourced model and data for ULTRAIF: Advancing Instruction Following from the Wild.
li sheng
bambisheng
AI & ML interests
None yet
Recent Activity
upvoted a paper 3 days ago
How Far Can Unsupervised RLVR Scale LLM Training? upvoted a paper 4 months ago
Scaling Latent Reasoning via Looped Language Models upvoted a paper 6 months ago
rStar2-Agent: Agentic Reasoning Technical Report