data: ../stateshiftbench_dataset/data/cases strategy: direct limit: 5 output: outputs/direct_smoke.jsonl