Least-Loaded Expert Parallelism: Load Balancing An Imbalanced Mixture-of-Experts Paper • 2601.17111 • Published 4 days ago • 4
Synthesizing Agentic Data for Web Agents with Progressive Difficulty Enhancement Mechanisms Paper • 2510.13913 • Published Oct 15, 2025 • 4
Teaching with Lies: Curriculum DPO on Synthetic Negatives for Hallucination Detection Paper • 2505.17558 • Published May 23, 2025 • 15
MedHallu: A Comprehensive Benchmark for Detecting Medical Hallucinations in Large Language Models Paper • 2502.14302 • Published Feb 20, 2025 • 9