ReMix: Reinforcement routing for mixtures of LoRAs in LLM finetuning Paper • 2603.10160 • Published 3 days ago • 20
Causal Concept Graphs in LLM Latent Space for Stepwise Reasoning Paper • 2603.10377 • Published 2 days ago • 3
UniCom: Unified Multimodal Modeling via Compressed Continuous Semantic Representations Paper • 2603.10702 • Published 2 days ago • 3
LLM2Vec-Gen: Generative Embeddings from Large Language Models Paper • 2603.10913 • Published 2 days ago • 23
Thinking to Recall: How Reasoning Unlocks Parametric Knowledge in LLMs Paper • 2603.09906 • Published 3 days ago • 57
MM-Zero: Self-Evolving Multi-Model Vision Language Models From Zero Data Paper • 2603.09206 • Published 3 days ago • 41
Omni-Diffusion: Unified Multimodal Understanding and Generation with Masked Discrete Diffusion Paper • 2603.06577 • Published 7 days ago • 43
Sparse-BitNet: 1.58-bit LLMs are Naturally Friendly to Semi-Structured Sparsity Paper • 2603.05168 • Published 8 days ago • 4
Scaling Agentic Capabilities, Not Context: Efficient Reinforcement Finetuning for Large Toolspaces Paper • 2603.06713 • Published 8 days ago • 15
Lost in Stories: Consistency Bugs in Long Story Generation by LLMs Paper • 2603.05890 • Published 7 days ago • 81
AutoResearch-RL: Perpetual Self-Evaluating Reinforcement Learning Agents for Autonomous Neural Architecture Discovery Paper • 2603.07300 • Published 6 days ago • 14
Believe Your Model: Distribution-Guided Confidence Calibration Paper • 2603.03872 • Published 9 days ago • 36
Dynamic Model Routing and Cascading for Efficient LLM Inference: A Survey Paper • 2603.04445 • Published 18 days ago • 4