Spaces:
Sleeping
Sleeping
metadata
title: ZeroEngine V0.1
emoji: 🚀
colorFrom: gray
colorTo: gray
sdk: gradio
sdk_version: 6.5.0
app_file: app.py
pinned: false
license: apache-2.0
python_version: 3.11
ZeroEngine V0.1 (Kernel)
High-performance inference engine for 2-vCPU / 16GB RAM constraints.
Optimizations
- KV-Cache Stitching: Asynchronous pre-evaluation of queue inputs.
- Hard Partitioning: Dedicated core assignment per concurrent user.
- Memory Mapping: weights mapped via
mmapto preserve RAM for context.