LabVLA: Grounding Vision-Language-Action Models in Scientific Laboratories Paper • 2606.13578 • Published 16 days ago • 54
When Should Models Change Their Minds? Contextual Belief Management in Large Language Models Paper • 2605.30219 • Published 30 days ago • 26
Length Value Model: Scalable Value Pretraining for Token-Level Length Modeling Paper • 2604.27039 • Published Apr 29 • 26
Rewarding the Scientific Process: Process-Level Reward Modeling for Agentic Data Analysis Paper • 2604.24198 • Published Apr 27 • 23
Chat2Workflow: A Benchmark for Generating Executable Visual Workflows with Natural Language Paper • 2604.19667 • Published Apr 21 • 23
LightThinker++: From Reasoning Compression to Memory Management Paper • 2604.03679 • Published Apr 4 • 38
AudioSAE: Towards Understanding of Audio-Processing Models with Sparse AutoEncoders Paper • 2602.05027 • Published Feb 4 • 63
How Controllable Are Large Language Models? A Unified Evaluation across Behavioral Granularities Paper • 2603.02578 • Published Mar 3 • 25
view article Article Making LLMs Truly Remember You | LightMem: Lightweight and Efficient Memory-Augmented Generation xzwnlp • Feb 28 • 5
view article Article Create, Evaluate, and Connect AI Skills | SkillNet: A Large-Scale Agentic "Skill Graph" Knowledge Base xzwnlp • Feb 28 • 13
InnoEval: On Research Idea Evaluation as a Knowledge-Grounded, Multi-Perspective Reasoning Problem Paper • 2602.14367 • Published Feb 16 • 17