Update README.md
Browse files
README.md
CHANGED
|
@@ -23,18 +23,31 @@ metrics:
|
|
| 23 |
</div>
|
| 24 |
|
| 25 |
|
| 26 |
-
<div align="center">
|
| 27 |
-
|
| 28 |
-
|
| 29 |
-
|
| 30 |
-
|
| 31 |
-
|
| 32 |
-
|
| 33 |
-
|
| 34 |
-
|
| 35 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 36 |
</div>
|
| 37 |
|
|
|
|
| 38 |
## Abstract
|
| 39 |
State-of-the-art advances in LLMs have pushed NLP to its limits, where even complex tasks, such as code generation, can be automated. This paper describes the deepSQL-R1-distill-8B, a fine-tuned and quantized model variant of the DeepSeek-R1 model architecture and specifically optimized for text-to-SQL conversion. Fine-tuning was performed using Unsloth, one of the most efficient frameworks for fine-tuning LLMs, in combination with Parameter-Efficient Fine-Tuning and the SFTTrainer framework. This allows domain-specific adaptation with minimal resource consumption. The approach fine-tunes curated datasets by LoRA, ensuring a more parameter-efficient and lower-memory-consuming model. Besides this, we investigate reinforcement learning techniques to further enhance the model's ability in generating accurate and contextually appropriate SQL queries. Combination of 8-bit quantization, LoRA, Unsloth, and reinforcement learning places deepSQL-R1-distill-8B as one of the cutting-edge solutions for automatic SQL code generation in real-world applications. Addressing major challenges in computational efficiency, domain-specific adaptation, and reinforcement-based refinement, this model is leading the way toward a more intuitive and resource-effective way of interacting with relational databases.
|
| 40 |
|
|
@@ -194,43 +207,50 @@ _ = model.generate(**inputs, streamer=text_streamer, max_new_tokens=350)
|
|
| 194 |
```
|
| 195 |
|
| 196 |
## Author
|
| 197 |
-
<div align="center">
|
| 198 |
-
<
|
| 199 |
-
<
|
| 200 |
-
|
| 201 |
-
|
| 202 |
-
<
|
| 203 |
-
|
| 204 |
-
|
| 205 |
-
<
|
| 206 |
-
|
| 207 |
-
|
| 208 |
-
<
|
| 209 |
-
|
| 210 |
-
|
| 211 |
-
<
|
| 212 |
-
|
| 213 |
-
|
| 214 |
-
<
|
| 215 |
-
|
| 216 |
-
|
| 217 |
-
<
|
| 218 |
-
|
| 219 |
-
|
| 220 |
-
<
|
| 221 |
-
|
| 222 |
-
|
| 223 |
-
<
|
| 224 |
-
|
| 225 |
-
</a>
|
| 226 |
-
</p>
|
| 227 |
</div>
|
| 228 |
|
| 229 |
|
|
|
|
| 230 |
## Usages Services
|
| 231 |
-
<
|
| 232 |
-
|
| 233 |
-
|
| 234 |
-
|
| 235 |
-
|
| 236 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 23 |
</div>
|
| 24 |
|
| 25 |
|
| 26 |
+
<div align="center" style="line-height: 1;">
|
| 27 |
+
<a href="https://huggingface.co/imsanjoykb/deepSQL-R1-distill-8B" target="_blank" style="margin: 2px;">
|
| 28 |
+
<img alt="Hugging Face Model" src="https://img.shields.io/badge/HuggingFace-Model-FF6F00?style=for-the-badge&logo=huggingface&logoColor=white" style="display: inline-block; vertical-align: middle;">
|
| 29 |
+
</a>
|
| 30 |
+
<a href="https://drive.google.com/file/d/145PP-oW50OMS1bYJaYuUphfufpsuOGWl/view?usp=sharing" target="_blank" style="margin: 2px;">
|
| 31 |
+
<img alt="Open In Colab" src="https://img.shields.io/badge/Open%20in%20Colab-FF6F00%2F000000?style=for-the-badge&logo=googlecolab&logoColor=white&labelColor=FF6F00" style="display: inline-block; vertical-align: middle;">
|
| 32 |
+
</a>
|
| 33 |
+
<a href="https://www.kaggle.com/code/imsanjoykb/inference-deepsql-r1-distill-8b" target="_blank" style="margin: 2px;">
|
| 34 |
+
<img alt="Kaggle Notebook" src="https://img.shields.io/badge/Kaggle-Notebook-20BEFF?style=for-the-badge&logo=kaggle&logoColor=white" style="display: inline-block; vertical-align: middle;">
|
| 35 |
+
</a>
|
| 36 |
+
<a href="https://github.com/imsanjoykb/deepSQL-R1-distill-8B" target="_blank" style="margin: 2px;">
|
| 37 |
+
<img alt="GitHub Repo" src="https://img.shields.io/badge/GitHub-Repo-181717?style=for-the-badge&logo=github&logoColor=white" style="display: inline-block; vertical-align: middle;">
|
| 38 |
+
</a>
|
| 39 |
+
<a href="https://huggingface.co/spaces/imsanjoykb/deepSQL-R1-distill-8B" target="_blank" style="margin: 2px;">
|
| 40 |
+
<img alt="Gradio App" src="https://img.shields.io/badge/Chat%20App-Gradio-0084FF?style=for-the-badge&logo=gradio&logoColor=white" style="display: inline-block; vertical-align: middle;">
|
| 41 |
+
</a>
|
| 42 |
+
<a href="https://colab.research.google.com/drive/1ze7qAQnjppZKfxNVBXXlOBTM6xFWEYrJ?usp=sharing" target="_blank" style="margin: 2px;">
|
| 43 |
+
<img alt="Gradio-Colab" src="https://img.shields.io/badge/Gradio-Colab-0084FF?style=for-the-badge&logo=gradio&labelColor=F9AB00" style="display: inline-block; vertical-align: middle;">
|
| 44 |
+
</a>
|
| 45 |
+
<a href="https://arxiv.org/abs/Your_Paper_ID" target="_blank" style="margin: 2px;">
|
| 46 |
+
<img alt="arXiv Paper" src="https://img.shields.io/badge/arXiv-Preprint-B31B1B?style=for-the-badge&logo=arxiv&logoColor=white" style="display: inline-block; vertical-align: middle;">
|
| 47 |
+
</a>
|
| 48 |
</div>
|
| 49 |
|
| 50 |
+
|
| 51 |
## Abstract
|
| 52 |
State-of-the-art advances in LLMs have pushed NLP to its limits, where even complex tasks, such as code generation, can be automated. This paper describes the deepSQL-R1-distill-8B, a fine-tuned and quantized model variant of the DeepSeek-R1 model architecture and specifically optimized for text-to-SQL conversion. Fine-tuning was performed using Unsloth, one of the most efficient frameworks for fine-tuning LLMs, in combination with Parameter-Efficient Fine-Tuning and the SFTTrainer framework. This allows domain-specific adaptation with minimal resource consumption. The approach fine-tunes curated datasets by LoRA, ensuring a more parameter-efficient and lower-memory-consuming model. Besides this, we investigate reinforcement learning techniques to further enhance the model's ability in generating accurate and contextually appropriate SQL queries. Combination of 8-bit quantization, LoRA, Unsloth, and reinforcement learning places deepSQL-R1-distill-8B as one of the cutting-edge solutions for automatic SQL code generation in real-world applications. Addressing major challenges in computational efficiency, domain-specific adaptation, and reinforcement-based refinement, this model is leading the way toward a more intuitive and resource-effective way of interacting with relational databases.
|
| 53 |
|
|
|
|
| 207 |
```
|
| 208 |
|
| 209 |
## Author
|
| 210 |
+
<div align="center" style="line-height: 1;">
|
| 211 |
+
<a href="mailto:imsanjoykb@gmail.com" target="_blank" style="margin: 2px;">
|
| 212 |
+
<img alt="Email" src="https://img.shields.io/badge/Gmail-D14836?style=for-the-badge&logo=gmail&logoColor=white" style="display: inline-block; vertical-align: middle;">
|
| 213 |
+
</a>
|
| 214 |
+
<a href="https://imsanjoykb.github.io/" target="_blank" style="margin: 2px;">
|
| 215 |
+
<img alt="Portfolio" src="https://img.shields.io/badge/Portfolio-8B89CC?style=for-the-badge&logo=protonmail&logoColor=white" style="display: inline-block; vertical-align: middle;">
|
| 216 |
+
</a>
|
| 217 |
+
<a href="https://www.linkedin.com/in/imsanjoykb/" target="_blank" style="margin: 2px;">
|
| 218 |
+
<img alt="Linkedin" src="https://img.shields.io/badge/LinkedIn-0077B5?style=for-the-badge&logo=linkedin&logoColor=white" style="display: inline-block; vertical-align: middle;">
|
| 219 |
+
</a>
|
| 220 |
+
<a href="https://orcid.org/0009-0001-6265-841X" target="_blank" style="margin: 2px;">
|
| 221 |
+
<img alt="ORCID" src="https://img.shields.io/badge/ORCID-0000--002--182-green?style=for-the-badge&logo=orcid&logoColor=white" style="display: inline-block; vertical-align: middle;">
|
| 222 |
+
</a>
|
| 223 |
+
<a href="https://github.com/imsanjoykb/" target="_blank" style="margin: 2px;">
|
| 224 |
+
<img alt="Github" src="https://img.shields.io/badge/GitHub-100000?style=for-the-badge&logo=github&logoColor=white" style="display: inline-block; vertical-align: middle;">
|
| 225 |
+
</a>
|
| 226 |
+
<a href="https://medium.com/@imsanjoykb" target="_blank" style="margin: 2px;">
|
| 227 |
+
<img alt="Medium" src="https://img.shields.io/badge/Medium-000000?style=for-the-badge&logo=medium&logoColor=white" style="display: inline-block; vertical-align: middle;">
|
| 228 |
+
</a>
|
| 229 |
+
<a href="https://www.kaggle.com/imsanjoykb" target="_blank" style="margin: 2px;">
|
| 230 |
+
<img alt="Kaggle" src="https://img.shields.io/badge/Kaggle-20BEFF?style=for-the-badge&logo=Kaggle&logoColor=white" style="display: inline-block; vertical-align: middle;">
|
| 231 |
+
</a>
|
| 232 |
+
<a href="https://instagram.com/imsanjoykb/" target="_blank" style="margin: 2px;">
|
| 233 |
+
<img alt="Instagram" src="https://img.shields.io/badge/Instagram-E4405F?style=for-the-badge&logo=instagram&logoColor=white" style="display: inline-block; vertical-align: middle;">
|
| 234 |
+
</a>
|
| 235 |
+
<a href="https://discord.com/channels/@imsanjoykb" target="_blank" style="margin: 2px;">
|
| 236 |
+
<img alt="Discord" src="https://img.shields.io/badge/Discord-7289DA?style=for-the-badge&logo=discord&logoColor=white" style="display: inline-block; vertical-align: middle;">
|
| 237 |
+
</a>
|
|
|
|
|
|
|
| 238 |
</div>
|
| 239 |
|
| 240 |
|
| 241 |
+
|
| 242 |
## Usages Services
|
| 243 |
+
<div align="center" style="line-height: 1;">
|
| 244 |
+
<a href="#" target="_blank" style="margin: 2px;">
|
| 245 |
+
<img src="https://ia801209.us.archive.org/26/items/github.com-unslothai-unsloth_-_2023-12-03_15-21-29/cover.jpg" alt="Unsloth" width="100" style="display: inline-block; vertical-align: middle;">
|
| 246 |
+
</a>
|
| 247 |
+
<a href="#" target="_blank" style="margin: 2px;">
|
| 248 |
+
<img src="https://wandb.ai/logo.png" alt="Weights & Biases" width="100" style="display: inline-block; vertical-align: middle;">
|
| 249 |
+
</a>
|
| 250 |
+
<a href="#" target="_blank" style="margin: 2px;">
|
| 251 |
+
<img src="https://huggingface.co/front/assets/huggingface_logo.svg" alt="Hugging Face" width="100" style="display: inline-block; vertical-align: middle;">
|
| 252 |
+
</a>
|
| 253 |
+
<a href="#" target="_blank" style="margin: 2px;">
|
| 254 |
+
<img src="https://images.saasworthy.com/tr:w-160,h-0,c-at_max,e-sharpen-1/gradio_43063_logo_1681283997_9ue7l.jpg" alt="Gradio" width="100" style="display: inline-block; vertical-align: middle;">
|
| 255 |
+
</a>
|
| 256 |
+
</div>
|