nl2sql-copilot / benchmarks /evaluate_spider_pro.py

Commit History

feat(benchmarks): add pro evaluator with EM, structural match, execution accuracy, and safety consistency metrics
ebc7457

Melika Kheirieh commited on