466 GB
2,808 files
Updated 16 days ago
Name
Size
README.md1.13 kB
xet
eic_storage.py28.5 kB
xet
test_unit.py3.6 kB
xet
README.md

EIC as sglang HiCache Storage

EIC(Elastic Instant Cache) is a distributed database designed for LLM KV Cache. It supports RDMA, GDR and has the capabilities of distributed disaster tolerance and expansion. You can understand the principles and architecture of EIC through these articles: https://mp.weixin.qq.com/s/tasDqXf0Gxr3o_WCJ2IJUQ https://mp.weixin.qq.com/s/b_4YhTa96Zeklh23lv8qBw

Deploy EIC

You can visit the official link https://console.volcengine.com/eic and deploy EIC KVCache on your compute cluster with web UI.In addition, we provide particular image in volcano engine, which integrates various optimizations based on the official image. You may use test_unit.py to detect the connectivity of EIC.

Deploy Model With EIC

You can enable EIC KVCache offload with the official interface, such as

python -m sglang.launch_server \
    --model-path [model_path]
    --enable-hierarchical-cache \
    --hicache-storage-backend eic \
    --hicache-write-policy 'write_through' \
    --hicache-mem-layout 'page_first' \

For more details, you can see https://www.volcengine.com/docs/85848/1749188 .

Total size
466 GB
Files
2,808
Last updated
Jun 16
Pre-warmed CDN
US EU US EU

Contributors