Effective Length Extrapolation via Dimension-Wise Positional Embeddings Manipulation Paper • 2504.18857 • Published Apr 26, 2025
Mitigating Object Hallucinations in MLLMs via Multi-Frequency Perturbations Paper • 2503.14895 • Published Mar 19, 2025
The Role of Entropy in Visual Grounding: Analysis and Optimization Paper • 2512.06726 • Published Dec 7, 2025 • 1
Which Reasoning Trajectories Teach Students to Reason Better? A Simple Metric of Informative Alignment Paper • 2601.14249 • Published 6 days ago • 8
Which Reasoning Trajectories Teach Students to Reason Better? A Simple Metric of Informative Alignment Paper • 2601.14249 • Published 6 days ago • 8
Which Reasoning Trajectories Teach Students to Reason Better? A Simple Metric of Informative Alignment Paper • 2601.14249 • Published 6 days ago • 8
OpenNovelty: An LLM-powered Agentic System for Verifiable Scholarly Novelty Assessment Paper • 2601.01576 • Published 22 days ago • 18
Analyzing the Effects of Supervised Fine-Tuning on Model Knowledge from Token and Parameter Levels Paper • 2509.16596 • Published Sep 20, 2025 • 14
Analyzing the Effects of Supervised Fine-Tuning on Model Knowledge from Token and Parameter Levels Paper • 2509.16596 • Published Sep 20, 2025 • 14
Analyzing the Effects of Supervised Fine-Tuning on Model Knowledge from Token and Parameter Levels Paper • 2509.16596 • Published Sep 20, 2025 • 14 • 2
Intern-S1: A Scientific Multimodal Foundation Model Paper • 2508.15763 • Published Aug 21, 2025 • 262
Reasoning or Memorization? Unreliable Results of Reinforcement Learning Due to Data Contamination Paper • 2507.10532 • Published Jul 14, 2025 • 90
Pre-Trained Policy Discriminators are General Reward Models Paper • 2507.05197 • Published Jul 7, 2025 • 39
Pre-Trained Policy Discriminators are General Reward Models Paper • 2507.05197 • Published Jul 7, 2025 • 39