Instructions to use yolay/Qwen2.5-1.5B-Open-R1-GRPO with libraries, inference providers, notebooks, and local apps. Follow these links to get started.

Libraries

How to use yolay/Qwen2.5-1.5B-Open-R1-GRPO with Transformers:

# Use a pipeline as a high-level helper
from transformers import pipeline

pipe = pipeline("text-generation", model="yolay/Qwen2.5-1.5B-Open-R1-GRPO")
messages = [
    {"role": "user", "content": "Who are you?"},
]
pipe(messages)

# Load model directly
from transformers import AutoTokenizer, AutoModelForCausalLM

tokenizer = AutoTokenizer.from_pretrained("yolay/Qwen2.5-1.5B-Open-R1-GRPO")
model = AutoModelForCausalLM.from_pretrained("yolay/Qwen2.5-1.5B-Open-R1-GRPO")
messages = [
    {"role": "user", "content": "Who are you?"},
]
inputs = tokenizer.apply_chat_template(
	messages,
	add_generation_prompt=True,
	tokenize=True,
	return_dict=True,
	return_tensors="pt",
).to(model.device)

outputs = model.generate(**inputs, max_new_tokens=40)
print(tokenizer.decode(outputs[0][inputs["input_ids"].shape[-1]:]))

Notebooks
Google Colab
Kaggle
Local Apps Settings

vLLM

How to use yolay/Qwen2.5-1.5B-Open-R1-GRPO with vLLM:

Install from pip and serve model

# Install vLLM from pip:
pip install vllm
# Start the vLLM server:
vllm serve "yolay/Qwen2.5-1.5B-Open-R1-GRPO"
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:8000/v1/chat/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "yolay/Qwen2.5-1.5B-Open-R1-GRPO",
		"messages": [
			{
				"role": "user",
				"content": "What is the capital of France?"
			}
		]
	}'

Use Docker

docker model run hf.co/yolay/Qwen2.5-1.5B-Open-R1-GRPO

SGLang

How to use yolay/Qwen2.5-1.5B-Open-R1-GRPO with SGLang:

Install from pip and serve model

# Install SGLang from pip:
pip install sglang
# Start the SGLang server:
python3 -m sglang.launch_server \
    --model-path "yolay/Qwen2.5-1.5B-Open-R1-GRPO" \
    --host 0.0.0.0 \
    --port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/chat/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "yolay/Qwen2.5-1.5B-Open-R1-GRPO",
		"messages": [
			{
				"role": "user",
				"content": "What is the capital of France?"
			}
		]
	}'

Use Docker images

docker run --gpus all \
    --shm-size 32g \
    -p 30000:30000 \
    -v ~/.cache/huggingface:/root/.cache/huggingface \
    --env "HF_TOKEN=<secret>" \
    --ipc=host \
    lmsysorg/sglang:latest \
    python3 -m sglang.launch_server \
        --model-path "yolay/Qwen2.5-1.5B-Open-R1-GRPO" \
        --host 0.0.0.0 \
        --port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/chat/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "yolay/Qwen2.5-1.5B-Open-R1-GRPO",
		"messages": [
			{
				"role": "user",
				"content": "What is the capital of France?"
			}
		]
	}'

Docker Model Runner
How to use yolay/Qwen2.5-1.5B-Open-R1-GRPO with Docker Model Runner:
```
docker model run hf.co/yolay/Qwen2.5-1.5B-Open-R1-GRPO
```

Qwen2.5-1.5B-Open-R1-GRPO

Commit History

End of training

95cd941
verified

yolay commited on Feb 12, 2025

Model save

c9313e0
verified

yolay commited on Feb 12, 2025

Training in progress, step 646

1a900b4
verified

yolay commited on Feb 12, 2025

Training in progress, step 600

622f03d
verified

yolay commited on Feb 12, 2025

Training in progress, step 500

307b64b
verified

yolay commited on Feb 12, 2025

Training in progress, step 400

da7c85a
verified

yolay commited on Feb 11, 2025

Training in progress, step 300

5f8cfe2
verified

yolay commited on Feb 11, 2025

Training in progress, step 100

5bdec38
verified

yolay commited on Feb 11, 2025

initial commit

9a234da
verified

yolay commited on Feb 11, 2025

Commit History

End of training 95cd941 verified

Model save c9313e0 verified

Training in progress, step 646 1a900b4 verified

Training in progress, step 600 622f03d verified

Training in progress, step 500 307b64b verified

Training in progress, step 400 da7c85a verified

Training in progress, step 300 5f8cfe2 verified

Training in progress, step 100 5bdec38 verified

initial commit 9a234da verified

End of training

95cd941
verified

Model save

c9313e0
verified

Training in progress, step 646

1a900b4
verified

Training in progress, step 600

622f03d
verified

Training in progress, step 500

307b64b
verified

Training in progress, step 400

da7c85a
verified

Training in progress, step 300

5f8cfe2
verified

Training in progress, step 100

5bdec38
verified

initial commit

9a234da
verified