nl2sql-copilot / benchmarks

Commit History

fix(ignore): whitelist data/demo.db in git and docker ignore rules
b432020

Melika Kheirieh commited on

fix(ui): remove all mock/Spider fallbacks and route queries to real backend only
cc371b0

Melika Kheirieh commited on

docs(readme): revamp and polish README for production showcase
8f50117

Melika Kheirieh commited on

feat(core): stabilize benchmark pipeline with accurate latency tracking, retry-empty handling, and refined plots
bf06cf7

Melika Kheirieh commited on

fix(core): non-zero generator timing + one-shot EMPTY retry; post-verify drop LIMIT to recover EM when ExecAcc=1
3b2af0f

Melika Kheirieh commited on

perf(planner): trim relevant tables (+cache) to cut latency; keep repair loop & rich traces
8b2d603

Melika Kheirieh commited on

refactor(core): trace schema upgrade, verifier/executor sync, benchmark plot polish
e3e0ac5

Melika Kheirieh commited on

feat(trace): enrich StageTrace (sql_length/row_count/verified/error_type/repair_attempts/skipped) and propagate in normalization; tag EmptySQL; annotate repair attempts
3716701

Melika Kheirieh commited on

feat(bench): gold-aware EM/SM/ExecAcc + p50/p95; write per-stage means; richer plots
296a94d

Melika Kheirieh commited on

feat(core): refine pipeline & verifier; improve Spider benchmark accuracy
b794494

Melika Kheirieh commited on

feat(bench): auto-detect latest run and plot per-stage latency + metrics summary
db1d448

Melika Kheirieh commited on

chore(factory): safely load .env via dotenv (with fallback under CI)
b21cd69

Melika Kheirieh commited on

chore(factory): safely load .env via dotenv (with fallback under CI)
f8224ec

Melika Kheirieh commited on

fix(grafana): move nl2sql.json into provisioning folder and fix dashboard mount path
454d146

Melika Kheirieh commited on

feat(benchmarks): add pro evaluator with EM, structural match, execution accuracy, and safety consistency metrics
ebc7457

Melika Kheirieh commited on

feat(benchmarks): align Spider eval with config-driven Pipeline and native Safety; log per-stage trace; add CSV summary
ed681b1

Melika Kheirieh commited on

feat(benchmarks): align Spider eval with config-driven Pipeline and native Safety; log per-stage trace; add CSV summary
598536c

Melika Kheirieh commited on

style: format code with ruff
dcc30f0

Melika Kheirieh commited on

fix(types): resolve mypy errors and make pytest pass
eee3f75

Melika Kheirieh commited on

build(mypy): fix type errors and add safety guards for None values
a337fad

Melika Kheirieh commited on

Fix some typo
713d3ca

Melika Kheirieh commited on

style: format code with ruff
105e019

Melika Kheirieh commited on

style: format code with ruff
c1bc4eb

Melika Kheirieh commited on

init: NL2SQL Copilot base with API and Dockerfile
570f7bd

Melika Kheirieh commited on

Add more advanced metrics
5eeca35

Melika Kheirieh commited on

Add first benchmark
e207f41

Melika Kheirieh commited on