Spreadsheet-RL: Advancing Large Language Model Agents on Realistic Spreadsheet Tasks via Reinforcement Learning Paper • 2605.22642 • Published 3 days ago • 32
An Empirical Analysis of Compute-Optimal Inference for Problem-Solving with Language Models Paper • 2408.00724 • Published Aug 1, 2024 • 2
Easy-to-Hard Generalization: Scalable Alignment Beyond Human Supervision Paper • 2403.09472 • Published Mar 14, 2024 • 1