Spaces:

NanoBotAIAgent
/

gemma4-uncensored-api

Sleeping

NanoBotAIAgent commited on 6 days ago

Commit

f1242e0

verified ·

1 Parent(s): 1322b24

Update title to Q8, update README

Files changed (1) hide show

README.md CHANGED Viewed

@@ -1,11 +1,52 @@
 ---
-title: Gemma-4-E4B Uncensored API
-emoji: 🌍
-colorFrom: gray
-colorTo: blue
 sdk: docker
 app_port: 8000
 pinned: false
 ---
-Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference

 ---
+title: Gemma-4-E4B Uncensored Q8 API
+emoji: 🔓
+colorFrom: pink
+colorTo: pink
 sdk: docker
 app_port: 8000
 pinned: false
 ---
+OpenAI-compatible API for [HauhauCS/Gemma-4-E4B-Uncensored-HauhauCS-Aggressive](https://huggingface.co/HauhauCS/Gemma-4-E4B-Uncensored-HauhauCS-Aggressive)
+## Model Details
+| Spec | Value |
+|------|-------|
+| Model | Gemma-4-E4B |
+| Quantization | Q8_K_P (high quality) |
+| Context | 131072 tokens |
+| Concurrent | 1 request |
+| Reasoning | Enabled by default (`--jinja --reasoning-format deepseek`) |
+## Endpoints
+- `POST /v1/chat/completions` — Chat completions (streaming recommended)
+- `POST /v1/completions` — Text completions
+- `GET /v1/models` — List models
+- `GET /health` — Health check
+- `GET /api-info` — JSON status
+## Usage
+```python
+import openai
+client = openai.OpenAI(
+    base_url="https://nanobotaiagent-gemma4-uncensored-api.hf.space/v1",
+    api_key="no-key",
+    timeout=600.0,
+)
+response = client.chat.completions.create(
+    model="gemma",
+    messages=[{"role": "user", "content": "Hello!"}],
+    max_tokens=2048,
+    stream=True,
+)
+for chunk in response:
+    delta = chunk.choices[0].delta
+    if delta.content:
+        print(delta.content, end="")
+```