Instructions to use arthurwangheng/OpenRS-GRPO with libraries, inference providers, notebooks, and local apps. Follow these links to get started.

Libraries

How to use arthurwangheng/OpenRS-GRPO with Transformers:

# Use a pipeline as a high-level helper
from transformers import pipeline

pipe = pipeline("text-generation", model="arthurwangheng/OpenRS-GRPO")
messages = [
    {"role": "user", "content": "Who are you?"},
]
pipe(messages)

# Load model directly
from transformers import AutoTokenizer, AutoModelForCausalLM

tokenizer = AutoTokenizer.from_pretrained("arthurwangheng/OpenRS-GRPO")
model = AutoModelForCausalLM.from_pretrained("arthurwangheng/OpenRS-GRPO")
messages = [
    {"role": "user", "content": "Who are you?"},
]
inputs = tokenizer.apply_chat_template(
	messages,
	add_generation_prompt=True,
	tokenize=True,
	return_dict=True,
	return_tensors="pt",
).to(model.device)

outputs = model.generate(**inputs, max_new_tokens=40)
print(tokenizer.decode(outputs[0][inputs["input_ids"].shape[-1]:]))

Notebooks
Google Colab
Kaggle
Local Apps

vLLM

How to use arthurwangheng/OpenRS-GRPO with vLLM:

Install from pip and serve model

# Install vLLM from pip:
pip install vllm
# Start the vLLM server:
vllm serve "arthurwangheng/OpenRS-GRPO"
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:8000/v1/chat/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "arthurwangheng/OpenRS-GRPO",
		"messages": [
			{
				"role": "user",
				"content": "What is the capital of France?"
			}
		]
	}'

Use Docker

docker model run hf.co/arthurwangheng/OpenRS-GRPO

SGLang

How to use arthurwangheng/OpenRS-GRPO with SGLang:

Install from pip and serve model

# Install SGLang from pip:
pip install sglang
# Start the SGLang server:
python3 -m sglang.launch_server \
    --model-path "arthurwangheng/OpenRS-GRPO" \
    --host 0.0.0.0 \
    --port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/chat/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "arthurwangheng/OpenRS-GRPO",
		"messages": [
			{
				"role": "user",
				"content": "What is the capital of France?"
			}
		]
	}'

Use Docker images

docker run --gpus all \
    --shm-size 32g \
    -p 30000:30000 \
    -v ~/.cache/huggingface:/root/.cache/huggingface \
    --env "HF_TOKEN=<secret>" \
    --ipc=host \
    lmsysorg/sglang:latest \
    python3 -m sglang.launch_server \
        --model-path "arthurwangheng/OpenRS-GRPO" \
        --host 0.0.0.0 \
        --port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/chat/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "arthurwangheng/OpenRS-GRPO",
		"messages": [
			{
				"role": "user",
				"content": "What is the capital of France?"
			}
		]
	}'

Docker Model Runner
How to use arthurwangheng/OpenRS-GRPO with Docker Model Runner:
```
docker model run hf.co/arthurwangheng/OpenRS-GRPO
```

OpenRS-GRPO

Commit History

End of training

b916787
verified

arthurwangheng commited on Mar 27, 2025

Model save

73ec496
verified

arthurwangheng commited on Mar 27, 2025

Training in progress, step 500

3bed350
verified

arthurwangheng commited on Mar 27, 2025

Training in progress, step 450

39f28ac
verified

arthurwangheng commited on Mar 27, 2025

Training in progress, step 400

9089194
verified

arthurwangheng commited on Mar 27, 2025

Training in progress, step 350

0e78d3b
verified

arthurwangheng commited on Mar 27, 2025

Training in progress, step 300

dce4092
verified

arthurwangheng commited on Mar 27, 2025

End of training

1d1487f
verified

arthurwangheng commited on Mar 27, 2025

Model save

0fa4119
verified

arthurwangheng commited on Mar 27, 2025

Training in progress, step 251

30f75e7
verified

arthurwangheng commited on Mar 27, 2025

Training in progress, step 250

8cd1967
verified

arthurwangheng commited on Mar 26, 2025

Training in progress, step 200

1fb0e85
verified

arthurwangheng commited on Mar 26, 2025

Training in progress, step 150

f03744d
verified

arthurwangheng commited on Mar 26, 2025

Training in progress, step 100

d184114
verified

arthurwangheng commited on Mar 26, 2025

Training in progress, step 50

82424ea
verified

arthurwangheng commited on Mar 26, 2025

initial commit

ad1d1e4
verified

arthurwangheng commited on Mar 26, 2025

Commit History

End of training b916787 verified

Model save 73ec496 verified

Training in progress, step 500 3bed350 verified

Training in progress, step 450 39f28ac verified

Training in progress, step 400 9089194 verified

Training in progress, step 350 0e78d3b verified

Training in progress, step 300 dce4092 verified

End of training 1d1487f verified

Model save 0fa4119 verified

Training in progress, step 251 30f75e7 verified

Training in progress, step 250 8cd1967 verified

Training in progress, step 200 1fb0e85 verified

Training in progress, step 150 f03744d verified

Training in progress, step 100 d184114 verified

Training in progress, step 50 82424ea verified

initial commit ad1d1e4 verified

End of training

b916787
verified

Model save

73ec496
verified

Training in progress, step 500

3bed350
verified

Training in progress, step 450

39f28ac
verified

Training in progress, step 400

9089194
verified

Training in progress, step 350

0e78d3b
verified

Training in progress, step 300

dce4092
verified

End of training

1d1487f
verified

Model save

0fa4119
verified

Training in progress, step 251

30f75e7
verified

Training in progress, step 250

8cd1967
verified

Training in progress, step 200

1fb0e85
verified

Training in progress, step 150

f03744d
verified

Training in progress, step 100

d184114
verified

Training in progress, step 50

82424ea
verified

initial commit

ad1d1e4
verified