Instructions to use inclusionAI/Ring-lite with libraries, inference providers, notebooks, and local apps. Follow these links to get started.

Libraries

How to use inclusionAI/Ring-lite with Transformers:

# Use a pipeline as a high-level helper
from transformers import pipeline

pipe = pipeline("text-generation", model="inclusionAI/Ring-lite", trust_remote_code=True)
messages = [
    {"role": "user", "content": "Who are you?"},
]
pipe(messages)

# Load model directly
from transformers import AutoModelForCausalLM
model = AutoModelForCausalLM.from_pretrained("inclusionAI/Ring-lite", trust_remote_code=True, dtype="auto")

Notebooks
Google Colab
Kaggle
Local Apps Settings

vLLM

How to use inclusionAI/Ring-lite with vLLM:

Install from pip and serve model

# Install vLLM from pip:
pip install vllm
# Start the vLLM server:
vllm serve "inclusionAI/Ring-lite"
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:8000/v1/chat/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "inclusionAI/Ring-lite",
		"messages": [
			{
				"role": "user",
				"content": "What is the capital of France?"
			}
		]
	}'

Use Docker

docker model run hf.co/inclusionAI/Ring-lite

SGLang

How to use inclusionAI/Ring-lite with SGLang:

Install from pip and serve model

# Install SGLang from pip:
pip install sglang
# Start the SGLang server:
python3 -m sglang.launch_server \
    --model-path "inclusionAI/Ring-lite" \
    --host 0.0.0.0 \
    --port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/chat/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "inclusionAI/Ring-lite",
		"messages": [
			{
				"role": "user",
				"content": "What is the capital of France?"
			}
		]
	}'

Use Docker images

docker run --gpus all \
    --shm-size 32g \
    -p 30000:30000 \
    -v ~/.cache/huggingface:/root/.cache/huggingface \
    --env "HF_TOKEN=<secret>" \
    --ipc=host \
    lmsysorg/sglang:latest \
    python3 -m sglang.launch_server \
        --model-path "inclusionAI/Ring-lite" \
        --host 0.0.0.0 \
        --port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/chat/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "inclusionAI/Ring-lite",
		"messages": [
			{
				"role": "user",
				"content": "What is the capital of France?"
			}
		]
	}'

Docker Model Runner
How to use inclusionAI/Ring-lite with Docker Model Runner:
```
docker model run hf.co/inclusionAI/Ring-lite
```

Ring-lite

Commit History

Adding the `transformers` tag to populate the "Use this model" tab for wider visibility and usage. (#1)

237e860
verified

m1ngcheng

ariG23498 HF Staff commited on Aug 18, 2025

Update README.md

deb53f7
verified

LiangJiang commited on Aug 4, 2025

Update tokenizer_config.json

b45db76
verified

lemonpiece commited on Jul 9, 2025

Update README.md

f4cd357
verified

LiangJiang commited on Jul 4, 2025

Update README.md

5edf18f
verified

LiangJiang commited on Jul 4, 2025

Update README.md

091736d
verified

LiangJiang commited on Jul 4, 2025

Upload performance.png

19a8458
verified

LiangJiang commited on Jul 4, 2025

Delete model-00004-of-00004.safetensors

4bbf49b
verified

LiangJiang commited on Jul 4, 2025

Delete model-00003-of-00004.safetensors

247229a
verified

LiangJiang commited on Jul 4, 2025

Delete model-00002-of-00004.safetensors

ed40ad4
verified

LiangJiang commited on Jul 4, 2025

Delete model-00001-of-00004.safetensors

1093735
verified

LiangJiang commited on Jul 4, 2025

Add files using upload-large-folder tool

8f2e6a0
verified

LiangJiang commited on Jul 4, 2025

Add files using upload-large-folder tool

26625f7
verified

LiangJiang commited on Jul 4, 2025

Update README.md

c326308
verified

LiangJiang commited on Jun 21, 2025

Update README.md

d8beba5
verified

LiangJiang commited on Jun 21, 2025

Update README.md

9d76a87
verified

LiangJiang commited on Jun 20, 2025

Update README.md

0e11f23
verified

LiangJiang commited on Jun 20, 2025

Update README.md

51f3b16
verified

LiangJiang commited on Jun 18, 2025

Update README.md

7cfc6f6
verified

LiangJiang commited on Jun 18, 2025

Update README.md

8873993
verified

LiangJiang commited on Jun 18, 2025

Update README.md

a668bb4
verified

LiangJiang commited on Jun 18, 2025

Update README.md

4812a9f
verified

LiangJiang commited on Jun 18, 2025

Upload performance.png

7b510f5
verified

LiangJiang commited on Jun 18, 2025

Delete performace.png

7b49d4c
verified

LiangJiang commited on Jun 18, 2025

Upload performace.png

fc90f84
verified

LiangJiang commited on Jun 18, 2025

Update README.md

3ba64be
verified

LiangJiang commited on Jun 18, 2025

delete performace.pdf

428a664

LandyGuo commited on Jun 17, 2025