DataChef: Cooking Up Optimal Data Recipes for LLM Adaptation via Reinforcement Learning Paper • 2602.11089 • Published 1 day ago • 15
TL-GRPO: Turn-Level RL for Reasoning-Guided Iterative Optimization Paper • 2601.16480 • Published 21 days ago • 50
MIG Datasets Collection Processed data pools and sampled datasets released with MIG. • 13 items • Updated Apr 27, 2025 • 2
MIG: Automatic Data Selection for Instruction Tuning by Maximizing Information Gain in Semantic Space Paper • 2504.13835 • Published Apr 18, 2025 • 38