Instructions to use xiwenc1/OpenRS-DR_GRPO with libraries, inference providers, notebooks, and local apps. Follow these links to get started.

Libraries

How to use xiwenc1/OpenRS-DR_GRPO with Transformers:

# Use a pipeline as a high-level helper
from transformers import pipeline

pipe = pipeline("text-generation", model="xiwenc1/OpenRS-DR_GRPO")
messages = [
    {"role": "user", "content": "Who are you?"},
]
pipe(messages)

# Load model directly
from transformers import AutoTokenizer, AutoModelForCausalLM

tokenizer = AutoTokenizer.from_pretrained("xiwenc1/OpenRS-DR_GRPO")
model = AutoModelForCausalLM.from_pretrained("xiwenc1/OpenRS-DR_GRPO", device_map="auto")
messages = [
    {"role": "user", "content": "Who are you?"},
]
inputs = tokenizer.apply_chat_template(
	messages,
	add_generation_prompt=True,
	tokenize=True,
	return_dict=True,
	return_tensors="pt",
).to(model.device)

outputs = model.generate(**inputs, max_new_tokens=40)
print(tokenizer.decode(outputs[0][inputs["input_ids"].shape[-1]:]))

Notebooks
Google Colab
Kaggle
Local Apps Settings

vLLM

How to use xiwenc1/OpenRS-DR_GRPO with vLLM:

Install from pip and serve model

# Install vLLM from pip:
pip install vllm
# Start the vLLM server:
vllm serve "xiwenc1/OpenRS-DR_GRPO"
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:8000/v1/chat/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "xiwenc1/OpenRS-DR_GRPO",
		"messages": [
			{
				"role": "user",
				"content": "What is the capital of France?"
			}
		]
	}'

Use Docker

docker model run hf.co/xiwenc1/OpenRS-DR_GRPO

SGLang

How to use xiwenc1/OpenRS-DR_GRPO with SGLang:

Install from pip and serve model

# Install SGLang from pip:
pip install sglang
# Start the SGLang server:
python3 -m sglang.launch_server \
    --model-path "xiwenc1/OpenRS-DR_GRPO" \
    --host 0.0.0.0 \
    --port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/chat/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "xiwenc1/OpenRS-DR_GRPO",
		"messages": [
			{
				"role": "user",
				"content": "What is the capital of France?"
			}
		]
	}'

Use Docker images

docker run --gpus all \
    --shm-size 32g \
    -p 30000:30000 \
    -v ~/.cache/huggingface:/root/.cache/huggingface \
    --env "HF_TOKEN=<secret>" \
    --ipc=host \
    lmsysorg/sglang:latest \
    python3 -m sglang.launch_server \
        --model-path "xiwenc1/OpenRS-DR_GRPO" \
        --host 0.0.0.0 \
        --port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/chat/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "xiwenc1/OpenRS-DR_GRPO",
		"messages": [
			{
				"role": "user",
				"content": "What is the capital of France?"
			}
		]
	}'

Docker Model Runner
How to use xiwenc1/OpenRS-DR_GRPO with Docker Model Runner:
```
docker model run hf.co/xiwenc1/OpenRS-DR_GRPO
```

OpenRS-DR_GRPO

Commit History

End of training

7c22d12
verified

xiwenc1 commited on Apr 28, 2025

Model save

1eef637
verified

xiwenc1 commited on Apr 28, 2025

Training in progress, step 500

89609ac
verified

xiwenc1 commited on Apr 28, 2025

Training in progress, step 450

fe74e52
verified

xiwenc1 commited on Apr 28, 2025

Training in progress, step 400

5e26cfe
verified

xiwenc1 commited on Apr 28, 2025

Training in progress, step 350

5a34556
verified

xiwenc1 commited on Apr 28, 2025

Training in progress, step 300

0d932f4
verified

xiwenc1 commited on Apr 27, 2025

Training in progress, step 250

05c48a8
verified

xiwenc1 commited on Apr 27, 2025

Training in progress, step 200

bf5e69e
verified

xiwenc1 commited on Apr 27, 2025

Training in progress, step 150

27152e1
verified

xiwenc1 commited on Apr 27, 2025

Training in progress, step 100

88ca040
verified

xiwenc1 commited on Apr 27, 2025

Training in progress, step 50

41db806
verified

xiwenc1 commited on Apr 26, 2025

initial commit

6c0f5c8
verified

xiwenc1 commited on Apr 26, 2025

Commit History

End of training 7c22d12 verified

Model save 1eef637 verified

Training in progress, step 500 89609ac verified

Training in progress, step 450 fe74e52 verified

Training in progress, step 400 5e26cfe verified

Training in progress, step 350 5a34556 verified

Training in progress, step 300 0d932f4 verified

Training in progress, step 250 05c48a8 verified

Training in progress, step 200 bf5e69e verified

Training in progress, step 150 27152e1 verified

Training in progress, step 100 88ca040 verified

Training in progress, step 50 41db806 verified

initial commit 6c0f5c8 verified

End of training

7c22d12
verified

Model save

1eef637
verified

Training in progress, step 500

89609ac
verified

Training in progress, step 450

fe74e52
verified

Training in progress, step 400

5e26cfe
verified

Training in progress, step 350

5a34556
verified

Training in progress, step 300

0d932f4
verified

Training in progress, step 250

05c48a8
verified

Training in progress, step 200

bf5e69e
verified

Training in progress, step 150

27152e1
verified

Training in progress, step 100

88ca040
verified

Training in progress, step 50

41db806
verified

initial commit

6c0f5c8
verified