Avoiding Premature Collapse: Adaptive Annealing for Entropy-Regularized Structural Inference Paper • 2601.23039 • Published 11 days ago • 1 • 3
Exploring Knowledge Purification in Multi-Teacher Knowledge Distillation for LLMs Paper • 2602.01064 • Published 9 days ago • 1 • 3
Seg-ReSearch: Segmentation with Interleaved Reasoning and External Search Paper • 2602.04454 • Published 6 days ago • 1 • 3
SEAD: Self-Evolving Agent for Multi-Turn Service Dialogue Paper • 2602.03548 • Published 7 days ago • 3 • 3
QuantLRM: Quantization of Large Reasoning Models via Fine-Tuning Signals Paper • 2602.02581 • Published 10 days ago • 6 • 3
ReMiT: RL-Guided Mid-Training for Iterative LLM Evolution Paper • 2602.03075 • Published 7 days ago • 4 • 3
OmniVideo-R1: Reinforcing Audio-visual Reasoning with Query Intention and Modality Attention Paper • 2602.05847 • Published 5 days ago • 11 • 3
Group-Evolving Agents: Open-Ended Self-Improvement via Experience Sharing Paper • 2602.04837 • Published 6 days ago • 7 • 3
OmniMoE: An Efficient MoE by Orchestrating Atomic Experts at Scale Paper • 2602.05711 • Published 5 days ago • 8 • 3
RaBiT: Residual-Aware Binarization Training for Accurate and Efficient LLMs Paper • 2602.05367 • Published 5 days ago • 7 • 3
Back to Basics: Revisiting Exploration in Reinforcement Learning for LLM Reasoning via Generative Probabilities Paper • 2602.05281 • Published 5 days ago • 14 • 2
Pisets: A Robust Speech Recognition System for Lectures and Interviews Paper • 2601.18415 • Published 15 days ago • 31 • 3
Self-Improving Multilingual Long Reasoning via Translation-Reasoning Integrated Training Paper • 2602.05940 • Published 5 days ago • 16 • 3
MSign: An Optimizer Preventing Training Instability in Large Language Models via Stable Rank Restoration Paper • 2602.01734 • Published 8 days ago • 29 • 3
AudioSAE: Towards Understanding of Audio-Processing Models with Sparse AutoEncoders Paper • 2602.05027 • Published 6 days ago • 51 • 3
On the Entropy Dynamics in Reinforcement Fine-Tuning of Large Language Models Paper • 2602.03392 • Published 7 days ago • 48 • 4
OdysseyArena: Benchmarking Large Language Models For Long-Horizon, Active and Inductive Interactions Paper • 2602.05843 • Published 5 days ago • 53 • 3
You Need an Encoder for Native Position-Independent Caching Paper • 2602.01519 • Published 8 days ago • 3
Adaptive Evidence Weighting for Audio-Spatiotemporal Fusion Paper • 2602.03817 • Published 7 days ago • 3
RecGOAT: Graph Optimal Adaptive Transport for LLM-Enhanced Multimodal Recommendation with Dual Semantic Alignment Paper • 2602.00682 • Published 10 days ago • 1 • 3