XeAI
/

LLaMa_3.2_3B_Instruct_Text2SQL-Q4_K_M-GGUF_Legacy

Model card Files Files and versions

ZhafranR commited on Nov 7, 2024

Commit

ef741c2

·

verified ·

1 Parent(s): 50666c8

Update README.md

Files changed (1) hide show

README.md +25 -0

README.md CHANGED Viewed

@@ -10,6 +10,31 @@ base_model: XeAI/LLaMa_3.2_3B_Instruct_Text2SQL
 This model was converted to GGUF format from [`XeAI/LLaMa_3.2_3B_Instruct_Text2SQL`](https://huggingface.co/XeAI/LLaMa_3.2_3B_Instruct_Text2SQL) using llama.cpp via the ggml.ai's [GGUF-my-repo](https://huggingface.co/spaces/ggml-org/gguf-my-repo) space.
 Refer to the [original model card](https://huggingface.co/XeAI/LLaMa_3.2_3B_Instruct_Text2SQL) for more details on the model.
 ## Use with llama.cpp
 Install llama.cpp through brew (works on Mac and Linux)

 This model was converted to GGUF format from [`XeAI/LLaMa_3.2_3B_Instruct_Text2SQL`](https://huggingface.co/XeAI/LLaMa_3.2_3B_Instruct_Text2SQL) using llama.cpp via the ggml.ai's [GGUF-my-repo](https://huggingface.co/spaces/ggml-org/gguf-my-repo) space.
 Refer to the [original model card](https://huggingface.co/XeAI/LLaMa_3.2_3B_Instruct_Text2SQL) for more details on the model.
+## Use with llama-cpp-python
+```python
+from llama_cpp import Llama
+# Load the model
+model = Llama(
+    model_path="path_to_your_model.gguf",
+    n_ctx=2048,
+    n_batch=512,
+    n_threads=6
+)
+# Generate text
+output = model.create_completion(
+    "Your prompt here",
+    max_tokens=512,
+    temperature=0.7,
+    top_p=0.95,
+    top_k=40,
+    repeat_penalty=1.1
+)
+print(output['choices'][0]['text'])
+```
 ## Use with llama.cpp
 Install llama.cpp through brew (works on Mac and Linux)