Inference-Time Scaling of Verification: Self-Evolving Deep Research Agents via Test-Time Rubric-Guided Verification Paper • 2601.15808 • Published 5 days ago • 14
Ray121381/eveo_anchor_advantage_independent-qwen2.5-7b-sciworld-self-sum-self-gen-maxlen-2048 Updated Dec 23, 2025
Ray121381/eveo_anchor_advantage_independent-qwen2.5-7b-sciworld-self-sum-self-gen-maxlen-2048 Updated Dec 23, 2025
Guided Self-Evolving LLMs with Minimal Human Supervision Paper • 2512.02472 • Published Dec 2, 2025 • 53