NanoBotAIAgent commited on
Commit
f1242e0
Β·
verified Β·
1 Parent(s): 1322b24

Update title to Q8, update README

Browse files
Files changed (1) hide show
  1. README.md +46 -5
README.md CHANGED
@@ -1,11 +1,52 @@
1
  ---
2
- title: Gemma-4-E4B Uncensored API
3
- emoji: 🌍
4
- colorFrom: gray
5
- colorTo: blue
6
  sdk: docker
7
  app_port: 8000
8
  pinned: false
9
  ---
10
 
11
- Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  ---
2
+ title: Gemma-4-E4B Uncensored Q8 API
3
+ emoji: πŸ”“
4
+ colorFrom: pink
5
+ colorTo: pink
6
  sdk: docker
7
  app_port: 8000
8
  pinned: false
9
  ---
10
 
11
+ OpenAI-compatible API for [HauhauCS/Gemma-4-E4B-Uncensored-HauhauCS-Aggressive](https://huggingface.co/HauhauCS/Gemma-4-E4B-Uncensored-HauhauCS-Aggressive)
12
+
13
+ ## Model Details
14
+
15
+ | Spec | Value |
16
+ |------|-------|
17
+ | Model | Gemma-4-E4B |
18
+ | Quantization | Q8_K_P (high quality) |
19
+ | Context | 131072 tokens |
20
+ | Concurrent | 1 request |
21
+ | Reasoning | Enabled by default (`--jinja --reasoning-format deepseek`) |
22
+
23
+ ## Endpoints
24
+
25
+ - `POST /v1/chat/completions` β€” Chat completions (streaming recommended)
26
+ - `POST /v1/completions` β€” Text completions
27
+ - `GET /v1/models` β€” List models
28
+ - `GET /health` β€” Health check
29
+ - `GET /api-info` β€” JSON status
30
+
31
+ ## Usage
32
+
33
+ ```python
34
+ import openai
35
+
36
+ client = openai.OpenAI(
37
+ base_url="https://nanobotaiagent-gemma4-uncensored-api.hf.space/v1",
38
+ api_key="no-key",
39
+ timeout=600.0,
40
+ )
41
+
42
+ response = client.chat.completions.create(
43
+ model="gemma",
44
+ messages=[{"role": "user", "content": "Hello!"}],
45
+ max_tokens=2048,
46
+ stream=True,
47
+ )
48
+ for chunk in response:
49
+ delta = chunk.choices[0].delta
50
+ if delta.content:
51
+ print(delta.content, end="")
52
+ ```