Buckets:

leideng
/

QCFuse

leideng/QCFuse / srt /mem_cache /storage /lmcache

466 GB

2,808 files

Updated 19 days ago

Ctrl+K

Name	Size	Uploaded	Xet hash
README.md	870 Bytes xet	19 days ago	b34cb91b
example_config.yaml	158 Bytes xet	19 days ago	8ec69b45
lmc_radix_cache.py	10 kB xet	19 days ago	77d729a3
unit_test.py	3.59 kB xet	19 days ago	ae4dd28b

README.md

LMCache Connector for SGLang

This document describes how to use LMCache as KV Cache Management Backend for SGLang engine. For more details about LMCache, please refer to: https://lmcache.ai

Install LMCache

Method 1: with pip

pip install lmcache

Method 2: from source

Clone LMCache project:

git clone https://github.com/LMCache/LMCache

Install:

cd LMCache
pip install -e . --no-build-isolation

Use LMCache

Firstly, setup LMCache config. An example config is set at example_config.yaml. For more settings please refer to https://docs.lmcache.ai/api_reference/configurations.html.

Secondly, setup SGLang serving engine with lmcache:

export LMCACHE_USE_EXPERIMENTAL=True
export LMCACHE_CONFIG_FILE=example_config.yaml

python -m sglang.launch_server \
  --model-path MODEL \
  --enable-lmcache

Total size: 466 GB

Files: 2,808

Last updated: Jun 16

Pre-warmed CDN: US EU US EU

LMCache Connector for SGLang

Install LMCache

Method 1: with pip

Method 2: from source

Use LMCache

Contributors