view article Article Bringing Robotics AI to Embedded Platforms: Dataset Recording, VLA Fine‑Tuning, and On‑Device Optimizations 8 days ago • 7
view article Article Introducing Modular Diffusers - Composable Building Blocks for Diffusion Pipelines +2 9 days ago • 33
DualPath: Breaking the Storage Bandwidth Bottleneck in Agentic LLM Inference Paper • 2602.21548 • Published 16 days ago • 43
Evaluating AGENTS.md: Are Repository-Level Context Files Helpful for Coding Agents? Paper • 2602.11988 • Published 29 days ago • 4
Think Deep, Not Just Long: Measuring LLM Reasoning Effort via Deep-Thinking Tokens Paper • 2602.13517 • Published 28 days ago • 2
MAGMA: A Multi-Graph based Agentic Memory Architecture for AI Agents Paper • 2601.03236 • Published Jan 6 • 7
XiYan-SQL: A Multi-Generator Ensemble Framework for Text-to-SQL Paper • 2411.08599 • Published Nov 13, 2024 • 2
XiYan-SQL: A Novel Multi-Generator Framework For Text-to-SQL Paper • 2507.04701 • Published Jul 7, 2025 • 1
SQL-R1: Training Natural Language to SQL Reasoning Model By Reinforcement Learning Paper • 2504.08600 • Published Apr 11, 2025 • 33
SWE-SQL: Illuminating LLM Pathways to Solve User SQL Issues in Real-World Applications Paper • 2506.18951 • Published Jun 23, 2025 • 22
view article Article Community Evals: Because we're done trusting black-box leaderboards over the community +5 Feb 4 • 88
view article Article IBM and UC Berkeley Diagnose Why Enterprise Agents Fail Using IT-Bench and MAST 23 days ago • 18