Holistic Data Scheduler for LLM Pre-training via Multi-Objective Reinforcement Learning Paper • 2606.24133 • Published 2 days ago • 3
ReMMD: Realistic Multilingual Multi-Image Agentic Verification for Multimodal Misinformation Detection Paper • 2606.24112 • Published 2 days ago • 2
ReMMD: Realistic Multilingual Multi-Image Agentic Verification for Multimodal Misinformation Detection Paper • 2606.24112 • Published 2 days ago • 2
Holistic Data Scheduler for LLM Pre-training via Multi-Objective Reinforcement Learning Paper • 2606.24133 • Published 2 days ago • 3
AC-ODM: Actor--Critic Online Data Mixing for Sample-Efficient LLM Pretraining Paper • 2505.23878 • Published 11 days ago • 1
AC-ODM: Actor--Critic Online Data Mixing for Sample-Efficient LLM Pretraining Paper • 2505.23878 • Published 11 days ago • 1
AC-ODM: Actor--Critic Online Data Mixing for Sample-Efficient LLM Pretraining Paper • 2505.23878 • Published 11 days ago • 1
Mind-Brush: Integrating Agentic Cognitive Search and Reasoning into Image Generation Paper • 2602.01756 • Published Feb 2 • 23