Lei-bw
/

text-to-sql-fm

Text Generation

text2text-generation

Model card Files Files and versions

text-to-sql-fm / README.md

Lei-bw's picture

Update README.md

dcd50ca verified over 1 year ago

|

history blame contribute delete

2.9 kB

	---
	base_model: google/gemma-2b
	library_name: peft
	license: bsl-1.0
	tags:
	- code
	datasets:
	- b-mc2/sql-create-context
	language:
	- en
	pipeline_tag: text2text-generation
	---

	# Model Card for Model ID

	### Model Description

	This model is quantized in 8-bit and trained with question and answer pairs for text-to-SQL tasks using the LoRA PEFT method. It serves as a foundation model for further development in Text-to-SQL Retrieval-Augmented Generation (RAG) applications.

	- Developed by: Lei-bw
	- Model type: Causal Language Model
	- Language(s) (NLP): English
	- License: bsl-1.0
	- Finetuned from model: google/gemma-2b
	- Device to be used: NVIDIA GeForce RTX 3080 Ti (12288MiB)

	![image/png](https://cdn-uploads.huggingface.co/production/uploads/65600cc95b627f441ad58374/BXun8ig2s9QoASHz6sSP9.png)

	## How to Get Started with the Model

	1. Apply the access to model google/gemma-2b
	You may submit the approval from [google/gemma-2b](https://huggingface.co/google/gemma-2b)

	2. Login-in
	If you are using cli, please use below command to login your huggingface account with your access token key:
	```bash
	huggingface-cli login
	```

	OR

	If you are using notebook, please use below command to login your huggingface account with your access token key:
	```python
	from huggingface_hub import login
	login()
	```

	3. Install the required library

	```python
	peft==0.12.0
	transformers==4.44.2
	```


	4. Get access to the `Lei-bw/text-to-sql-fm` model
	```python
	from peft import PeftModel, PeftConfig
	from transformers import AutoModelForCausalLM, AutoTokenizer

	config = PeftConfig.from_pretrained("Lei-bw/text-to-sql-fm")
	base_model = AutoModelForCausalLM.from_pretrained("google/gemma-2b")
	model = PeftModel.from_pretrained(base_model, "Lei-bw/text-to-sql-fm")
	tokenizer = AutoTokenizer.from_pretrained("Lei-bw/text-to-sql-fm")
	```

	![image/png](https://cdn-uploads.huggingface.co/production/uploads/65600cc95b627f441ad58374/jPZ46TQAD_zD9G-Y9LypK.png)


	5. Example to use this model:
	```python
	import torch

	# Example usage
	eval_prompt = """
	How many seniors in the departments are younger than 28?
	"""

	model_input = tokenizer(eval_prompt, return_tensors="pt")

	model.eval()
	with torch.no_grad():
	print(tokenizer.decode(model.generate(**model_input, max_new_tokens=300)[0], skip_special_tokens=True))
	```

	## Training Details

	### Training Data

	The model was trained on the b-mc2/sql-create-context dataset, which contains question and answer pairs for SQL generation tasks.


	#### Training Hyperparameters

	• Training regime: bf16 mixed precision
	• Batch size: 16
	• Gradient accumulation steps: 4
	• Warmup steps: 50
	• Number of epochs: 2
	• Learning rate: 2e-4
	• Weight decay: 0.01
	• Optimizer: AdamW
	• Learning rate scheduler: Linear

	#### Hardware

	- Hardware Type: NVIDIA A100
	- GPU RAM: 40 GB


	### Framework versions

	- PEFT 0.12.0