Spaces:

Wizz13150
/

WizzGPT

Running

Wizz13150 commited on Jun 30, 2025

Commit

c904a84

1 Parent(s): b1548a4

Create app.py

Files changed (1) hide show

app.py ADDED Viewed

+import gradio as gr
+from llama_cpp import Llama
+# Charger le modèle GGUF (Q8_0)
+llm = Llama(
+    model_path="model/gptq_model.gguf",
+    n_threads=1,
+    n_ctx=2048,
+    n_batch=64,
+    use_mlock=True
+)
+def generate(prompt):
+    output = llm(prompt, max_tokens=200, stop=["</s>"], echo=False)
+    return output["choices"][0]["text"]
+# UI Gradio
+iface = gr.Interface(fn=generate,
+                     inputs=gr.Textbox(lines=5, label="Prompt"),
+                     outputs=gr.Textbox(label="Réponse"),
+                     title="WizzGPT - CPU Demo")
+iface.launch()