LLM2Vec-Gen: Generative Embeddings from Large Language Models Paper • 2603.10913 • Published 2 days ago • 27
RoboMME: Benchmarking and Understanding Memory for Robotic Generalist Policies Paper • 2603.04639 • Published 9 days ago • 22
BandPO: Bridging Trust Regions and Ratio Clipping via Probability-Aware Bounds for LLM Reinforcement Learning Paper • 2603.04918 • Published 8 days ago • 54
Penguin-VL: Exploring the Efficiency Limits of VLM with LLM-based Vision Encoders Paper • 2603.06569 • Published 7 days ago • 102
Large Multimodal Models as General In-Context Classifiers Paper • 2602.23229 • Published 15 days ago • 22
LaSER: Internalizing Explicit Reasoning into Latent Space for Dense Retrieval Paper • 2603.01425 • Published 12 days ago • 5
LLaDA-o: An Effective and Length-Adaptive Omni Diffusion Model Paper • 2603.01068 • Published 12 days ago • 20
CUDA Agent: Large-Scale Agentic RL for High-Performance CUDA Kernel Generation Paper • 2602.24286 • Published 14 days ago • 85
On Data Engineering for Scaling LLM Terminal Capabilities Paper • 2602.21193 • Published 17 days ago • 94
SimVLA: A Simple VLA Baseline for Robotic Manipulation Paper • 2602.18224 • Published 21 days ago • 5
Does Your Reasoning Model Implicitly Know When to Stop Thinking? Paper • 2602.08354 • Published Feb 9 • 261