FlashAttention: Fast and Memory-Efficient Exact Attention with IO-Awareness Paper • 2205.14135 • Published May 27, 2022 • 15 • 4
Mecellem Models: Turkish Models Trained from Scratch and Continually Pre-trained for the Legal Domain Paper • 2601.16018 • Published 4 days ago • 7 • 2
Towards Automated Kernel Generation in the Era of LLMs Paper • 2601.15727 • Published 5 days ago • 16 • 3
VideoMaMa: Mask-Guided Video Matting via Generative Prior Paper • 2601.14255 • Published 6 days ago • 13 • 3
Numba-Accelerated 2D Diffusion-Limited Aggregation: Implementation and Fractal Characterization Paper • 2601.15440 • Published 5 days ago • 1 • 3
ActionMesh: Animated 3D Mesh Generation with Temporal 3D Diffusion Paper • 2601.16148 • Published 4 days ago • 12 • 4
Nested Learning: The Illusion of Deep Learning Architectures Paper • 2512.24695 • Published 27 days ago • 40 • 6
LLM-in-Sandbox Elicits General Agentic Intelligence Paper • 2601.16206 • Published 4 days ago • 72 • 4
HERMES: KV Cache as Hierarchical Memory for Efficient Streaming Video Understanding Paper • 2601.14724 • Published 6 days ago • 71 • 4
BayesianVLA: Bayesian Decomposition of Vision Language Action Models via Latent Action Queries Paper • 2601.15197 • Published 5 days ago • 54 • 4
Being-H0.5: Scaling Human-Centric Robot Learning for Cross-Embodiment Generalization Paper • 2601.12993 • Published 7 days ago • 74 • 3
Very Deep Convolutional Networks for Large-Scale Image Recognition Paper • 1409.1556 • Published Sep 4, 2014 • 2 • 1
Toward Efficient Agents: Memory, Tool learning, and Planning Paper • 2601.14192 • Published 6 days ago • 49 • 3
Advances and Frontiers of LLM-based Issue Resolution in Software Engineering: A Comprehensive Survey Paper • 2601.11655 • Published 11 days ago • 59 • 5
Gemma: Open Models Based on Gemini Research and Technology Paper • 2403.08295 • Published Mar 13, 2024 • 50 • 6