Holistic Data Scheduler for LLM Pre-training via Multi-Objective Reinforcement Learning Paper • 2606.24133 • Published 2 days ago • 3
Qwen-AgentWorld: Language World Models for General Agents Paper • 2606.24597 • Published 2 days ago • 86
Genie Envisioner Collection Collection for AgiBot Genie Envisioner • 2 items • Updated about 14 hours ago • 1
Beyond Reward Engineering: A Data Recipe for Long-Context Reinforcement Learning Paper • 2606.18831 • Published 8 days ago • 4
view article Article Jawbreaker: Private Scam Defense for Someone You Love build-small-hackathon • 12 days ago • 4
huihui-ai/Huihui-Qwen3-VL-8B-Instruct-abliterated-FP8 Image-Text-to-Text • 8B • Updated about 23 hours ago • 4