Strategic Scaling of Test-Time Compute: A Bandit Learning Approach Paper • 2506.12721 • Published Jun 15, 2025 • 1