Spaces:

VcRlAgent
/

TestLLMGen

Runtime error

VcRlAgent commited on Nov 16, 2025

Commit

07e4e32

1 Parent(s): 956bbf8

Starter LLM Inference Call

Files changed (2) hide show

app.py ADDED Viewed

+import os
+import gradio as gr
+from openai import OpenAI
+# Initialize HF Router client using OpenAI SDK
+client = OpenAI(
+    base_url="https://router.huggingface.co/v1",
+    api_key=os.environ["HF_TOKEN"],   # ensure HF_TOKEN is set
+)
+# LLM function
+def ask_llm(prompt):
+    try:
+        completion = client.chat.completions.create(
+            model="meta-llama/Llama-3.1-8B-Instruct",
+            messages=[
+                {"role": "user", "content": prompt}
+            ],
+            max_tokens=200
+        )
+        return completion.choices[0].message["content"]
+    except Exception as e:
+        return f"Error: {str(e)}"
+# Build Gradio UI
+demo = gr.Interface(
+    fn=ask_llm,
+    inputs=gr.Textbox(lines=3, label="Ask the AI"),
+    outputs=gr.Textbox(label="Response"),
+    title="HF Router LLM Demo",
+    description="Powered by HuggingFace Router + OpenAI SDK client."
+)
+demo.launch()

requirements.txt ADDED Viewed

	@@ -0,0 +1 @@


1	+ openai>=1.51.0