AnyGroundBench: A Specialized-Domain Benchmark for Video Grounding in Vision-Language Models Paper • 2607.02269 • Published 2 days ago • 7
Domino: Decoupling Causal Modeling from Autoregressive Drafting in Speculative Decoding Paper • 2605.29707 • Published May 28 • 152
Crafter: A Multi-Agent Harness for Editable Scientific Figure Generation from Diverse Inputs Paper • 2605.30611 • Published May 28 • 250
stefanocarrera/autophagycode_D_he_train-mercury_Qwen3-4B_strategy_surplexity_t1_g5_run1_metrics Viewer • Updated Jun 2 • 164 • 15 • 1
Gamma-World: Generative Multi-Agent World Modeling Beyond Two Players Paper • 2605.28816 • Published May 27 • 431
AutoResearchClaw: Self-Reinforcing Autonomous Research with Human-AI Collaboration Paper • 2605.20025 • Published May 19 • 190
GameWorld: Towards Standardized and Verifiable Evaluation of Multimodal Game Agents Paper • 2604.07429 • Published Apr 8 • 123