466 GB
2,808 files
Updated 19 days ago
Name
Size
README.md870 Bytes
xet
example_config.yaml158 Bytes
xet
lmc_radix_cache.py10 kB
xet
unit_test.py3.59 kB
xet
README.md

LMCache Connector for SGLang

This document describes how to use LMCache as KV Cache Management Backend for SGLang engine. For more details about LMCache, please refer to: https://lmcache.ai

Install LMCache

Method 1: with pip

pip install lmcache

Method 2: from source

Clone LMCache project:

git clone https://github.com/LMCache/LMCache

Install:

cd LMCache
pip install -e . --no-build-isolation

Use LMCache

Firstly, setup LMCache config. An example config is set at example_config.yaml. For more settings please refer to https://docs.lmcache.ai/api_reference/configurations.html.

Secondly, setup SGLang serving engine with lmcache:

export LMCACHE_USE_EXPERIMENTAL=True
export LMCACHE_CONFIG_FILE=example_config.yaml

python -m sglang.launch_server \
  --model-path MODEL \
  --enable-lmcache
Total size
466 GB
Files
2,808
Last updated
Jun 16
Pre-warmed CDN
US EU US EU

Contributors