SimBench: Benchmarking the Ability of Large Language Models to Simulate Human Behaviors Paper • 2510.17516 • Published Oct 20 • 2