QuantLRM: Quantization of Large Reasoning Models via Fine-Tuning Signals Paper • 2602.02581 • Published 10 days ago • 6
CAR-bench: Evaluating the Consistency and Limit-Awareness of LLM Agents under Real-World Uncertainty Paper • 2601.22027 • Published 11 days ago • 78
PaperSearchQA: Learning to Search and Reason over Scientific Papers with RLVR Paper • 2601.18207 • Published 15 days ago • 19
Length-Unbiased Sequence Policy Optimization: Revealing and Controlling Response Length Variation in RLVR Paper • 2602.05261 • Published 5 days ago • 47
On the Limits of Layer Pruning for Generative Reasoning in LLMs Paper • 2602.01997 • Published 8 days ago • 4
Rethinking LLM-as-a-Judge: Representation-as-a-Judge with Small Language Models via Semantic Capacity Asymmetry Paper • 2601.22588 • Published 11 days ago • 5
VisionTrim: Unified Vision Token Compression for Training-Free MLLM Acceleration Paper • 2601.22674 • Published 11 days ago • 5
RE-TRAC: REcursive TRAjectory Compression for Deep Search Agents Paper • 2602.02486 • Published 7 days ago • 17
LRAgent: Efficient KV Cache Sharing for Multi-LoRA LLM Agents Paper • 2602.01053 • Published 9 days ago • 8
Token Sparse Attention: Efficient Long-Context Inference with Interleaved Token Selection Paper • 2602.03216 • Published 7 days ago • 12
MARS: Modular Agent with Reflective Search for Automated AI Research Paper • 2602.02660 • Published 7 days ago • 60
From Data to Behavior: Predicting Unintended Model Behaviors Before Training Paper • 2602.04735 • Published 5 days ago • 14
Horizon-LM: A RAM-Centric Architecture for LLM Training Paper • 2602.04816 • Published 5 days ago • 16
VTC-R1: Vision-Text Compression for Efficient Long-Context Reasoning Paper • 2601.22069 • Published 11 days ago • 7
Latent Adversarial Regularization for Offline Preference Optimization Paper • 2601.22083 • Published 11 days ago • 13
MMDeepResearch-Bench: A Benchmark for Multimodal Deep Research Agents Paper • 2601.12346 • Published 23 days ago • 49
MemoryRewardBench: Benchmarking Reward Models for Long-Term Memory Management in Large Language Models Paper • 2601.11969 • Published 24 days ago • 26