LangToSQL / README.md
Komma-LuisMiSanVe's picture
Update files
c4225a4
metadata
language:
  - en
thumbnail: >-
  https://raw.githubusercontent.com/LuisMiSanVe/LuisMiSanVe/refs/heads/main/Resources/LangToSQL/LangToSQLLLM_banner.png
tags:
  - sql
  - text-to-sql
  - llm
  - fine-tuning
  - gguf
license: apache-2.0
datasets:
  - xlangai/spider
base_model: Qwen/Qwen2.5-Coder-1.5B-Instruct

See in spanish/Ver en espaΓ±ol

LangToSQL LLM Banner

πŸ€– AI Model for PostgreSQL queries

image image image Python PyTorch NumPy HuggingFace

Check out other versions of this program:

The AI model has been trained for turning natural language to PostgreSQL queries.

πŸ“ Technology Explanation

This model uses Gwen Coder as a base and then is fine tuned with Spider datasets.

The JSON dataset file contains Spider's train_spider.json as is the main dataset.

The model is exported to GGUF with llama.cpp so it can be used by programs like LM Studio.

πŸ› οΈ Setup

In order to execute the training script for your own, you first need to install Python and run this command:

pip install transformers datasets peft accelerate bitsandbytes trl==1.0.0

Depending on the version, you may have to use this instead:

py -m pip install transformers datasets peft accelerate bitsandbytes trl==1.0.0

Make sure the TRL library version is 1.0.0, as is the only version supported by the trainer script.

πŸ“‚ Files

This repository includes the trained LLM model's files, its training script, the training dataset and a tester script to test the .safetensors model.

You can download the final GGUF in the Releases.

πŸš€ Releases

The version will be released using these versioning policies:
New major features and critical bug fixes will cause the immediate release of a new version, while other minor changes or fixes will wait one week since the time the change is introduced in the repository before being included in the new version, so that other potential changes can be added.

These potencial new changes will not increase the wait time for the new version beyond one week.

The version number will follow this format:
[Major Feature].[Minor Feature].[Bug Fixes]

πŸ’» Technologies Used