IntelliAsk: Learning to Ask High-Quality Research Questions via RLVR Paper • 2602.15849 • Published Jan 23 • 3
Agent READMEs: An Empirical Study of Context Files for Agentic Coding Paper • 2511.12884 • Published Nov 17, 2025 • 28 • 6
view article Article Keep the Tokens Flowing: Lessons from 16 Open-Source RL Libraries +7 aminediroHF, qgallouedec, kashif, lewtun, edbeeching, albertvillanova, nouamanetazi, lvwerra, sergiopaniego • Mar 10 • 151
view article Article Reproducing NanoBanana 2's "Window Seat" with AI-Scientist-V3 alexshengzhili • Mar 1 • 1
view article Article Reproducing NanoBanana 2's "Window Seat" with AI-Scientist-V3 alexshengzhili • Mar 1 • 1
view article Article AI Scientist v3: Agent Native refactor. Scale from 1-hour to 24 hours with Reviewer agent alexshengzhili • Mar 1 • 3
view article Article AI Scientist v3: Agent Native refactor. Scale from 1-hour to 24 hours with Reviewer agent alexshengzhili • Mar 1 • 3
Skywork-Reward-V2: Scaling Preference Data Curation via Human-AI Synergy Paper • 2507.01352 • Published Jul 2, 2025 • 60 • 7
alexshengzhili/generalreasoning-stage2-combined-filtered-kept Viewer • Updated Apr 29, 2025 • 25.4k • 10
alexshengzhili/generalreasoning-stage2-combined-filtered-kept Viewer • Updated Apr 29, 2025 • 25.4k • 10