gazet / dataset /scripts /export_training_data.py

Commit History

Randomize candidate dataset order
582d1ab

srmsoumya commited on

fix: reduce templates to country, region & county for mvp
3ba9557

srmsoumya commited on

Lowercase subtype names, fix question type hints
a5fd57d

srmsoumya commited on

Fix: No pairs are created for mixed queries
dfb9466

srmsoumya commited on

enh: Add templates to handle queries like subregion, region
ca28f70

srmsoumya commited on

prompt: Add notes on how to pick places from query better
d237392

srmsoumya commited on

fix: mixed templates are not generated because of stale parquet files
8364f3c

srmsoumya commited on

chore: clean sql generation, use conversation format, move prompts from user to system
c77ca5f

srmsoumya commited on

Add finetune instructions for gemma
1104031

srmsoumya commited on

FEAT: Add to create SLM training data
d8d4856

srmsoumya commited on