feat(eval): field-level extraction eval harness + tests 1b58e74 Dimitris Codex commited on 24 days ago