LLMs Improving LLMs: Agentic Discovery for Test-Time Scaling Paper • 2605.08083 • Published 3 days ago • 46
On Time, Within Budget: Constraint-Driven Online Resource Allocation for Agentic Workflows Paper • 2605.06110 • Published 4 days ago • 16
Nonsense Helps: Prompt Space Perturbation Broadens Reasoning Exploration Paper • 2605.05566 • Published 4 days ago • 34
TongZheng1999/Final-Reasoning-4B-Iter1-Strong-Init-Filtered-RB-by-Judge 4B • Updated 30 days ago • 13
TongZheng1999/Final-Reasoning-4B-Iter1-Strong-Init-Filtered-RB-by-Judge 4B • Updated 30 days ago • 13
TongZheng1999/iter_1_reinforce_baseline_per_sample_200epoch_strong_init_step_150_processed_Merge_f_by_judge Viewer • Updated about 1 month ago • 22.1k • 67
TongZheng1999/iter_1_reinforce_baseline_per_sample_200epoch_strong_init_step_150_processed_Merge_f_by_judge Viewer • Updated about 1 month ago • 22.1k • 67
TongZheng1999/iter_1_reinforce_baseline_per_sample_200epoch_strong_init_step_150_processed_filtered_by_judge Viewer • Updated about 1 month ago • 5.43k • 20
TongZheng1999/iter_1_reinforce_baseline_per_sample_200epoch_strong_init_step_150_processed_filtered_by_judge Viewer • Updated about 1 month ago • 5.43k • 20
TongZheng1999/Final-Reasoning-4B-Iter1-Strong-Init-Filtered-RB 4B • Updated about 1 month ago • 6
TongZheng1999/Final-Reasoning-4B-Iter1-Strong-Init-Filtered-RB 4B • Updated about 1 month ago • 6
TongZheng1999/iter_1_reinforce_baseline_per_sample_200epoch_strong_init_step_150_processed_Merge Viewer • Updated about 1 month ago • 33.4k • 55
TongZheng1999/iter_1_reinforce_baseline_per_sample_200epoch_strong_init_step_150_processed_Merge Viewer • Updated about 1 month ago • 33.4k • 55
TongZheng1999/iter_1_reinforce_baseline_per_sample_200epoch_strong_init_step_150_processed Viewer • Updated about 1 month ago • 16.7k • 19
TongZheng1999/iter_1_reinforce_baseline_per_sample_200epoch_strong_init_step_150_processed Viewer • Updated about 1 month ago • 16.7k • 19
TongZheng1999/iter_1_reinforce_baseline_per_sample_200epoch_strong_init_step_150 Viewer • Updated about 1 month ago • 16.7k • 19