Commit
·
c9d6c5c
1
Parent(s):
1b0e109
Update README.md
Browse files
README.md
CHANGED
|
@@ -19,6 +19,21 @@ license: bigcode-openrail-m
|
|
| 19 |
- Finetuning was performed using QLoRa on a single RTX6000 48GB.
|
| 20 |
- If you want the merged model: [richardr1126/spider-natsql-wizard-coder-merged](https://huggingface.co/richardr1126/spider-natsql-wizard-coder-merged)
|
| 21 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 22 |
## Citation
|
| 23 |
|
| 24 |
Please cite the repo if you use the data or code in this repo.
|
|
|
|
| 19 |
- Finetuning was performed using QLoRa on a single RTX6000 48GB.
|
| 20 |
- If you want the merged model: [richardr1126/spider-natsql-wizard-coder-merged](https://huggingface.co/richardr1126/spider-natsql-wizard-coder-merged)
|
| 21 |
|
| 22 |
+
### Spider Dataset
|
| 23 |
+
|
| 24 |
+
[Spider](https://arxiv.org/abs/1809.08887) is a large-scale complex and cross-domain semantic parsing and text-to-SQL dataset annotated by 11 Yale students
|
| 25 |
+
The goal of the Spider challenge is to develop natural language interfaces to cross-domain databases.
|
| 26 |
+
|
| 27 |
+
This dataset was used to finetune this model.
|
| 28 |
+
|
| 29 |
+
### NatSQL
|
| 30 |
+
|
| 31 |
+
[NatSQL](https://arxiv.org/abs/2109.05153) is an intermediate representation for SQL that simplifies the queries and reduces the mismatch between
|
| 32 |
+
natural language and SQL. NatSQL preserves the core functionalities of SQL, but removes some clauses and keywords
|
| 33 |
+
that are hard to infer from natural language descriptions. NatSQL also makes schema linking easier by reducing the
|
| 34 |
+
number of schema items to predict. NatSQL can be easily converted to executable SQL queries and can improve the
|
| 35 |
+
performance of text-to-SQL models.
|
| 36 |
+
|
| 37 |
## Citation
|
| 38 |
|
| 39 |
Please cite the repo if you use the data or code in this repo.
|