Less is More: Early Stopping Rollout for On-Policy Distillation Paper • 2605.27028 • Published May 26 • 15
DenoiseRL: Bootstrapping Reasoning Models to Recover from Noisy Prefixes Paper • 2605.28421 • Published May 27 • 48
Neglected Free Lunch from Post-training: Progress Advantage for LLM Agents Paper • 2606.26080 • Published 9 days ago • 10
Your UnEmbedding Matrix is Secretly a Feature Lens for Text Embeddings Paper • 2606.07502 • Published 28 days ago • 99
WTF GENIUS PAPERS Collection Papers that made me appreciate my major and my life a little more. obs=Observation, innov=Innovation. Most papers are abt improving tiny models. • 187 items • Updated about 22 hours ago • 45
Imagination Helps Visual Reasoning, But Not Yet in Latent Space Paper • 2602.22766 • Published Feb 26 • 44
GigaBrain-0.5M*: a VLA That Learns From World Model-Based Reinforcement Learning Paper • 2602.12099 • Published Feb 12 • 62
On the Entropy Dynamics in Reinforcement Fine-Tuning of Large Language Models Paper • 2602.03392 • Published Feb 3 • 59
Fast Autoregressive Video Diffusion and World Models with Temporal Cache Compression and Sparse Attention Paper • 2602.01801 • Published Feb 2 • 28
Towards Pixel-Level VLM Perception via Simple Points Prediction Paper • 2601.19228 • Published Jan 27 • 19
Linear representations in language models can change dramatically over a conversation Paper • 2601.20834 • Published Jan 28 • 21
Entropy-Adaptive Fine-Tuning: Resolving Confident Conflicts to Mitigate Forgetting Paper • 2601.02151 • Published Jan 5 • 115
I-Con: A Unifying Framework for Representation Learning Paper • 2504.16929 • Published Apr 23, 2025 • 31
Pretraining Frame Preservation in Autoregressive Video Memory Compression Paper • 2512.23851 • Published Dec 29, 2025 • 25
Fantastic Reasoning Behaviors and Where to Find Them: Unsupervised Discovery of the Reasoning Process Paper • 2512.23988 • Published Dec 30, 2025 • 19
Next-Embedding Prediction Makes Strong Vision Learners Paper • 2512.16922 • Published Dec 18, 2025 • 91