daVinci-Dev: Agent-native Mid-training for Software Engineering Paper • 2601.18418 • Published 1 day ago • 103 • 2
Diffusion In Diffusion: Reclaiming Global Coherence in Semi-Autoregressive Diffusion Paper • 2601.13599 • Published 8 days ago • 3 • 2
Can LLMs Clean Up Your Mess? A Survey of Application-Ready Data Preparation with LLMs Paper • 2601.17058 • Published 5 days ago • 125 • 2
FlashAttention: Fast and Memory-Efficient Exact Attention with IO-Awareness Paper • 2205.14135 • Published May 27, 2022 • 15 • 4
Mecellem Models: Turkish Models Trained from Scratch and Continually Pre-trained for the Legal Domain Paper • 2601.16018 • Published 5 days ago • 7 • 3
Towards Automated Kernel Generation in the Era of LLMs Paper • 2601.15727 • Published 5 days ago • 16 • 3
VideoMaMa: Mask-Guided Video Matting via Generative Prior Paper • 2601.14255 • Published 7 days ago • 13 • 3
Numba-Accelerated 2D Diffusion-Limited Aggregation: Implementation and Fractal Characterization Paper • 2601.15440 • Published 6 days ago • 1 • 3
ActionMesh: Animated 3D Mesh Generation with Temporal 3D Diffusion Paper • 2601.16148 • Published 5 days ago • 12 • 4
Nested Learning: The Illusion of Deep Learning Architectures Paper • 2512.24695 • Published 27 days ago • 40 • 6
LLM-in-Sandbox Elicits General Agentic Intelligence Paper • 2601.16206 • Published 5 days ago • 73 • 4
HERMES: KV Cache as Hierarchical Memory for Efficient Streaming Video Understanding Paper • 2601.14724 • Published 6 days ago • 71 • 4
BayesianVLA: Bayesian Decomposition of Vision Language Action Models via Latent Action Queries Paper • 2601.15197 • Published 6 days ago • 54 • 4
Being-H0.5: Scaling Human-Centric Robot Learning for Cross-Embodiment Generalization Paper • 2601.12993 • Published 8 days ago • 74 • 3
Very Deep Convolutional Networks for Large-Scale Image Recognition Paper • 1409.1556 • Published Sep 4, 2014 • 2 • 1
Toward Efficient Agents: Memory, Tool learning, and Planning Paper • 2601.14192 • Published 7 days ago • 49 • 3