Instructions to use lakshitha722/querymind-nl2sql with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Local Apps Settings
- Unsloth Studio
How to use lakshitha722/querymind-nl2sql with Unsloth Studio:
Install Unsloth Studio (macOS, Linux, WSL)
curl -fsSL https://unsloth.ai/install.sh | sh # Run unsloth studio unsloth studio -H 0.0.0.0 -p 8888 # Then open http://localhost:8888 in your browser # Search for lakshitha722/querymind-nl2sql to start chatting
Install Unsloth Studio (Windows)
irm https://unsloth.ai/install.ps1 | iex # Run unsloth studio unsloth studio -H 0.0.0.0 -p 8888 # Then open http://localhost:8888 in your browser # Search for lakshitha722/querymind-nl2sql to start chatting
Using HuggingFace Spaces for Unsloth
# No setup required # Open https://huggingface.co/spaces/unsloth/studio in your browser # Search for lakshitha722/querymind-nl2sql to start chatting
Load model with FastModel
pip install unsloth from unsloth import FastModel model, tokenizer = FastModel.from_pretrained( model_name="lakshitha722/querymind-nl2sql", max_seq_length=2048, )
File size: 2,996 Bytes
fed27a9 15e7520 fed27a9 15e7520 fed27a9 15e7520 fed27a9 15e7520 fed27a9 15e7520 fed27a9 15e7520 fed27a9 15e7520 fed27a9 15e7520 fed27a9 15e7520 fed27a9 15e7520 fed27a9 15e7520 fed27a9 15e7520 fed27a9 15e7520 fed27a9 15e7520 fed27a9 15e7520 fed27a9 15e7520 fed27a9 15e7520 fed27a9 15e7520 fed27a9 15e7520 fed27a9 15e7520 fed27a9 15e7520 fed27a9 15e7520 fed27a9 15e7520 fed27a9 15e7520 fed27a9 15e7520 fed27a9 15e7520 fed27a9 15e7520 | 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 | ---
license: apache-2.0
base_model: unsloth/Llama-3.2-3B-Instruct
tags:
- text-to-sql
- nl2sql
- unsloth
- llama
- lora
- qlora
datasets:
- spider
metrics:
- exact_match
- similarity
model-index:
- name: querymind-nl2sql
results: []
---
# 🧠 QueryMind: Natural Language to SQL Engine
QueryMind is a domain-specific, highly-optimized **NL-to-SQL engine** powered by a fine-tuned **LLaMA 3.2 3B Instruct** model. It has been fine-tuned using **QLoRA (4-bit)** via **Unsloth** on the **Spider NL2SQL dataset** to translate plain English queries into accurate, schema-valid SQL statements based on a provided database schema.
---
## 🎯 Model Details
- **Developed by:** Lakshitha Nuwan
- **Model type:** Causal Language Model (Fine-tuned LLM)
- **Language(s) (NLP):** English
- **License:** Apache 2.0
- **Finetuned from model:** [unsloth/Llama-3.2-3B-Instruct](https://huggingface.co/unsloth/Llama-3.2-3B-Instruct)
- **Training Framework:** Unsloth & PyTorch
---
## 🔗 Model Sources
- **HuggingFace Repository:** [lakshitha722/querymind-nl2sql](https://huggingface.co/lakshitha722/querymind-nl2sql)
- **Interactive Live Demo:** [HuggingFace Space Demo](https://huggingface.co/spaces/lakshitha722/querymind-nl2sql-demo)
- **Experiment Tracking:** [Weights & Biases (W&B) Dashboard](https://wandb.ai/lakshithanuwan722-other/querymind-nl2sql)
---
## 💻 How to Get Started with the Model
Use the code below to load the model and generate SQL queries using **Unsloth** (recommended for local GPUs) or standard HuggingFace **Transformers**.
### Inference with Unsloth (Recommended)
```python
from unsloth import FastLanguageModel
import torch
MODEL_NAME = "lakshitha722/querymind-nl2sql"
model, tokenizer = FastLanguageModel.from_pretrained(
model_name = MODEL_NAME,
max_seq_length = 1024,
load_in_4bit = True,
dtype = None,
)
FastLanguageModel.for_inference(model)
# 1. Define Prompt Template
PROMPT_TEMPLATE = """Below is an instruction that describes a task. Write a response that appropriately completes the request.
### Instruction:
Convert the following natural language question to a SQL query based on the given database schema. Return ONLY the SQL query, nothing else.
### Schema:
{schema}
### Question:
{question}
### Response:
"""
# 2. Prepare Inputs
schema = "Database: company\nTables: employees (id, name, department, salary, hire_date)"
question = "What is the average salary by department?"
prompt = PROMPT_TEMPLATE.format(schema=schema, question=question)
inputs = tokenizer([prompt], return_tensors="pt").to("cuda")
# 3. Generate
with torch.no_grad():
outputs = model.generate(
**inputs,
max_new_tokens = 150,
temperature = 0.1,
do_sample = False,
pad_token_id = tokenizer.eos_token_id,
)
# 4. Decode Output
input_length = inputs['input_ids'].shape[1]
sql = tokenizer.decode(outputs[0][input_length:], skip_special_tokens=True).strip()
print("Generated SQL:", sql) |