ZeroEngine / README.md
turtle170's picture
Update README.md
111b6d9 verified
|
raw
history blame
510 Bytes
metadata
title: ZeroEngine V0.1
emoji: 🚀
colorFrom: gray
colorTo: gray
sdk: gradio
sdk_version: 6.5.0
app_file: app.py
pinned: false
license: apache-2.0
python_version: 3.11

ZeroEngine V0.1 (Kernel)

High-performance inference engine for 2-vCPU / 16GB RAM constraints.

Optimizations

  • KV-Cache Stitching: Asynchronous pre-evaluation of queue inputs.
  • Hard Partitioning: Dedicated core assignment per concurrent user.
  • Memory Mapping: weights mapped via mmap to preserve RAM for context.