Step 3.5 Flash: Open Frontier-Level Intelligence with 11B Active Parameters Paper • 2602.10604 • Published 2 days ago • 167
OPUS: Towards Efficient and Principled Data Selection in Large Language Model Pre-training in Every Iteration Paper • 2602.05400 • Published 8 days ago • 298
Weak-Driven Learning: How Weak Agents make Strong Agents Stronger Paper • 2602.08222 • Published 4 days ago • 245
view article Article Community Evals: Because we're done trusting black-box leaderboards over the community +5 9 days ago • 64
OdysseyArena: Benchmarking Large Language Models For Long-Horizon, Active and Inductive Interactions Paper • 2602.05843 • Published 8 days ago • 57
view article Article Training Design for Text-to-Image Models: Lessons from Ablations 10 days ago • 57
CAR-bench: Evaluating the Consistency and Limit-Awareness of LLM Agents under Real-World Uncertainty Paper • 2601.22027 • Published 15 days ago • 79
WideSeek-R1: Exploring Width Scaling for Broad Information Seeking via Multi-Agent Reinforcement Learning Paper • 2602.04634 • Published 9 days ago • 92
AOrchestra: Automating Sub-Agent Creation for Agentic Orchestration Paper • 2602.03786 • Published 10 days ago • 84
No Global Plan in Chain-of-Thought: Uncover the Latent Planning Horizon of LLMs Paper • 2602.02103 • Published 11 days ago • 68
Idea2Story: An Automated Pipeline for Transforming Research Concepts into Complete Scientific Narratives Paper • 2601.20833 • Published 16 days ago • 175
Scaling Embeddings Outperforms Scaling Experts in Language Models Paper • 2601.21204 • Published 15 days ago • 99
view article Article Architectural Choices in China's Open-Source AI Ecosystem: Building Beyond DeepSeek 17 days ago • 43
Can LLMs Clean Up Your Mess? A Survey of Application-Ready Data Preparation with LLMs Paper • 2601.17058 • Published 22 days ago • 188