view article Article Where should test-time compute go? Surprisal-guided selection in verifiable environments Feb 7 • 1