An Empirical Study on Preference Tuning Generalization and Diversity Under Domain Shift Paper • 2601.05882 • Published 19 days ago • 20
Scaling Zero-Shot Reference-to-Video Generation Paper • 2512.06905 • Published Dec 7, 2025 • 29
SSA: Sparse Sparse Attention by Aligning Full and Sparse Attention Outputs in Feature Space Paper • 2511.20102 • Published Nov 25, 2025 • 27
Latent Refinement Decoding: Enhancing Diffusion-Based Language Models by Refining Belief States Paper • 2510.11052 • Published Oct 13, 2025 • 52
CCD: Mitigating Hallucinations in Radiology MLLMs via Clinical Contrastive Decoding Paper • 2509.23379 • Published Sep 27, 2025 • 15
Fine-Tuning on Noisy Instructions: Effects on Generalization and Performance Paper • 2510.03528 • Published Oct 3, 2025 • 19
When Thinking Backfires: Mechanistic Insights Into Reasoning-Induced Misalignment Paper • 2509.00544 • Published Aug 30, 2025 • 11
IntrEx: A Dataset for Modeling Engagement in Educational Conversations Paper • 2509.06652 • Published Sep 8, 2025 • 26
Analysing Chain of Thought Dynamics: Active Guidance or Unfaithful Post-hoc Rationalisation? Paper • 2508.19827 • Published Aug 27, 2025 • 33
SciReplicate-Bench: Benchmarking LLMs in Agent-driven Algorithmic Reproduction from Research Papers Paper • 2504.00255 • Published Mar 31, 2025 • 1
Spectrum Projection Score: Aligning Retrieved Summaries with Reader Models in Retrieval-Augmented Generation Paper • 2508.05909 • Published Aug 8, 2025 • 21
A Comprehensive Survey of Self-Evolving AI Agents: A New Paradigm Bridging Foundation Models and Lifelong Agentic Systems Paper • 2508.07407 • Published Aug 10, 2025 • 98
deepseek-ai/DeepSeek-R1-Distill-Qwen-7B Text Generation • 8B • Updated Feb 24, 2025 • 600k • • 777