Spaces:
Sleeping
Sleeping
metadata
title: ZeroEngine V0.1
emoji: 🚀
colorFrom: blue
colorTo: gray
sdk: gradio
sdk_version: 6.5.0
app_file: app.py
pinned: false
license: apache-2.0
ZeroEngine System Kernel
A specialized inference engine optimized for low-resource Hugging Face Spaces (2 vCPUs / 16GB RAM).
Key Features
- Deterministic Partitioning: Strictly splits 2 vCPUs between two concurrent users.
- Resource Gatekeeper: Prevents OOM crashes with a strict 50% RAM model limit and 200MB system buffer.
- Ghosting Queue: Enables pre-typing and background prompt preparation for queued users.
- Persistence Layer: Tracks model popularity by pushing telemetry JSONs to the HF Hub via
HF_TOKEN.
Hardware Specifications
- CPU: 2 vCPUs (shared)
- RAM: 16 GB (Shared)
- Optimization:
llama-cppwith mmap and single-core pinning per slot.