TL-GRPO: Turn-Level RL for Reasoning-Guided Iterative Optimization Paper • 2601.16480 • Published 4 days ago • 50
MULTI: Multimodal Understanding Leaderboard with Text and Images Paper • 2402.03173 • Published Feb 5, 2024 • 3
Case2Code: Learning Inductive Reasoning with Synthetic Data Paper • 2407.12504 • Published Jul 17, 2024 • 8
FastMCTS: A Simple Sampling Strategy for Data Synthesis Paper • 2502.11476 • Published Feb 17, 2025 • 1
UnitCoder: Scalable Iterative Code Synthesis with Unit Test Guidance Paper • 2502.11460 • Published Feb 17, 2025
InternBootcamp Technical Report: Boosting LLM Reasoning with Verifiable Task Scaling Paper • 2508.08636 • Published Aug 12, 2025 • 2
Intern-S1: A Scientific Multimodal Foundation Model Paper • 2508.15763 • Published Aug 21, 2025 • 262
Intern-S1: A Scientific Multimodal Foundation Model Paper • 2508.15763 • Published Aug 21, 2025 • 262
Pre-Trained Policy Discriminators are General Reward Models Paper • 2507.05197 • Published Jul 7, 2025 • 39
InternVL3: Exploring Advanced Training and Test-Time Recipes for Open-Source Multimodal Models Paper • 2504.10479 • Published Apr 14, 2025 • 306