FlashLabs Chroma 1.0: A Real-Time End-to-End Spoken Dialogue Model with Personalized Voice Cloning Paper • 2601.11141 • Published 11 days ago • 20
FutureOmni: Evaluating Future Forecasting from Omni-Modal Context for Multimodal LLMs Paper • 2601.13836 • Published 7 days ago • 34
The Responsibility Vacuum: Organizational Failure in Scaled Agent Systems Paper • 2601.15059 • Published 6 days ago • 3
Conditional Memory via Scalable Lookup: A New Axis of Sparsity for Large Language Models Paper • 2601.07372 • Published 15 days ago • 37
What Matters in Data Curation for Multimodal Reasoning? Insights from the DCVLR Challenge Paper • 2601.10922 • Published 12 days ago • 3
Good agents related space, model, dataset Collection Good agents related space, model, dataset collection • 29 items • Updated 5 days ago • 1
AgencyBench: Benchmarking the Frontiers of Autonomous Agents in 1M-Token Real-World Contexts Paper • 2601.11044 • Published 11 days ago • 34
OlympicArena: Benchmarking Multi-discipline Cognitive Reasoning for Superintelligent AI Paper • 2406.12753 • Published Jun 18, 2024 • 17
ENACT: Evaluating Embodied Cognition with World Modeling of Egocentric Interaction Paper • 2511.20937 • Published Nov 26, 2025 • 16
PointWorld: Scaling 3D World Models for In-The-Wild Robotic Manipulation Paper • 2601.03782 • Published 20 days ago • 1
view article Article M2.1: Multilingual and Multi-Task Coding with Strong Generalization 22 days ago • 37
GLM-4.5: Agentic, Reasoning, and Coding (ARC) Foundation Models Paper • 2508.06471 • Published Aug 8, 2025 • 203