goodhart-gap-benchmark / data /goodhart_contested.jsonl

Commit History

v2.0: Combined with cgrt-consensus-5model data (8,050 disagreements, 1,556 contested)
ca5e3d7
verified

Adam1010 commited on