Spaces:
Running
Running
Update README.md
Browse files
README.md
CHANGED
|
@@ -1,14 +1,25 @@
|
|
| 1 |
---
|
| 2 |
-
title: ZeroEngine
|
| 3 |
-
emoji:
|
| 4 |
-
colorFrom:
|
| 5 |
-
colorTo:
|
| 6 |
sdk: gradio
|
| 7 |
sdk_version: 6.5.0
|
| 8 |
app_file: app.py
|
| 9 |
pinned: false
|
| 10 |
license: apache-2.0
|
| 11 |
-
short_description: 2 vCPUs, dynamic model running.
|
| 12 |
---
|
| 13 |
|
| 14 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
---
|
| 2 |
+
title: ZeroEngine V0.1
|
| 3 |
+
emoji: 🚀
|
| 4 |
+
colorFrom: blue
|
| 5 |
+
colorTo: gray
|
| 6 |
sdk: gradio
|
| 7 |
sdk_version: 6.5.0
|
| 8 |
app_file: app.py
|
| 9 |
pinned: false
|
| 10 |
license: apache-2.0
|
|
|
|
| 11 |
---
|
| 12 |
|
| 13 |
+
# ZeroEngine System Kernel
|
| 14 |
+
A specialized inference engine optimized for low-resource Hugging Face Spaces (2 vCPUs / 16GB RAM).
|
| 15 |
+
|
| 16 |
+
## Key Features
|
| 17 |
+
- **Deterministic Partitioning**: Strictly splits 2 vCPUs between two concurrent users.
|
| 18 |
+
- **Resource Gatekeeper**: Prevents OOM crashes with a strict 50% RAM model limit and 200MB system buffer.
|
| 19 |
+
- **Ghosting Queue**: Enables pre-typing and background prompt preparation for queued users.
|
| 20 |
+
- **Persistence Layer**: Tracks model popularity by pushing telemetry JSONs to the HF Hub via `HF_TOKEN`.
|
| 21 |
+
|
| 22 |
+
## Hardware Specifications
|
| 23 |
+
- **CPU**: 2 vCPUs (shared)
|
| 24 |
+
- **RAM**: 16 GB (Shared)
|
| 25 |
+
- **Optimization**: `llama-cpp` with mmap and single-core pinning per slot.
|