Instructions to use Thomas-Chou/Qwen2.5-1.5B-Open-R1-GRPO with libraries, inference providers, notebooks, and local apps. Follow these links to get started.

Libraries

How to use Thomas-Chou/Qwen2.5-1.5B-Open-R1-GRPO with Transformers:

# Use a pipeline as a high-level helper
from transformers import pipeline

pipe = pipeline("text-generation", model="Thomas-Chou/Qwen2.5-1.5B-Open-R1-GRPO")
messages = [
    {"role": "user", "content": "Who are you?"},
]
pipe(messages)

# Load model directly
from transformers import AutoTokenizer, AutoModelForCausalLM

tokenizer = AutoTokenizer.from_pretrained("Thomas-Chou/Qwen2.5-1.5B-Open-R1-GRPO")
model = AutoModelForCausalLM.from_pretrained("Thomas-Chou/Qwen2.5-1.5B-Open-R1-GRPO", device_map="auto")
messages = [
    {"role": "user", "content": "Who are you?"},
]
inputs = tokenizer.apply_chat_template(
	messages,
	add_generation_prompt=True,
	tokenize=True,
	return_dict=True,
	return_tensors="pt",
).to(model.device)

outputs = model.generate(**inputs, max_new_tokens=40)
print(tokenizer.decode(outputs[0][inputs["input_ids"].shape[-1]:]))

Notebooks
Google Colab
Kaggle
Local Apps Settings

vLLM

How to use Thomas-Chou/Qwen2.5-1.5B-Open-R1-GRPO with vLLM:

Install from pip and serve model

# Install vLLM from pip:
pip install vllm
# Start the vLLM server:
vllm serve "Thomas-Chou/Qwen2.5-1.5B-Open-R1-GRPO"
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:8000/v1/chat/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "Thomas-Chou/Qwen2.5-1.5B-Open-R1-GRPO",
		"messages": [
			{
				"role": "user",
				"content": "What is the capital of France?"
			}
		]
	}'

Use Docker

docker model run hf.co/Thomas-Chou/Qwen2.5-1.5B-Open-R1-GRPO

SGLang

How to use Thomas-Chou/Qwen2.5-1.5B-Open-R1-GRPO with SGLang:

Install from pip and serve model

# Install SGLang from pip:
pip install sglang
# Start the SGLang server:
python3 -m sglang.launch_server \
    --model-path "Thomas-Chou/Qwen2.5-1.5B-Open-R1-GRPO" \
    --host 0.0.0.0 \
    --port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/chat/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "Thomas-Chou/Qwen2.5-1.5B-Open-R1-GRPO",
		"messages": [
			{
				"role": "user",
				"content": "What is the capital of France?"
			}
		]
	}'

Use Docker images

docker run --gpus all \
    --shm-size 32g \
    -p 30000:30000 \
    -v ~/.cache/huggingface:/root/.cache/huggingface \
    --env "HF_TOKEN=<secret>" \
    --ipc=host \
    lmsysorg/sglang:latest \
    python3 -m sglang.launch_server \
        --model-path "Thomas-Chou/Qwen2.5-1.5B-Open-R1-GRPO" \
        --host 0.0.0.0 \
        --port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/chat/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "Thomas-Chou/Qwen2.5-1.5B-Open-R1-GRPO",
		"messages": [
			{
				"role": "user",
				"content": "What is the capital of France?"
			}
		]
	}'

Docker Model Runner
How to use Thomas-Chou/Qwen2.5-1.5B-Open-R1-GRPO with Docker Model Runner:
```
docker model run hf.co/Thomas-Chou/Qwen2.5-1.5B-Open-R1-GRPO
```

Qwen2.5-1.5B-Open-R1-GRPO

Commit History

End of training

e89fd66
verified

Thomas-Chou commited on Jan 9

Model save

e76c580
verified

Thomas-Chou commited on Jan 9

Training in progress, step 5600

eefe861
verified

Thomas-Chou commited on Jan 8

Training in progress, step 4800

f0d1165
verified

Thomas-Chou commited on Jan 8

Training in progress, step 3200

b3f07c4
verified

Thomas-Chou commited on Jan 7

Training in progress, step 2400

287f65c
verified

Thomas-Chou commited on Jan 4

Training in progress, step 1600

588ce34
verified

Thomas-Chou commited on Jan 4

Training in progress, step 800

5635361
verified

Thomas-Chou commited on Jan 4

End of training

9fc151c
verified

Thomas-Chou commited on Jan 4

Model save

15191f6
verified

Thomas-Chou commited on Jan 4

Training in progress, step 2400

1cb3f62
verified

Thomas-Chou commited on Jan 3

Training in progress, step 1600

909a33e
verified

Thomas-Chou commited on Jan 3

Training in progress, step 800

1c7983a
verified

Thomas-Chou commited on Jan 3

End of training

71ef1a3
verified

Thomas-Chou commited on Feb 10, 2025

Model save

86cdb62
verified

Thomas-Chou commited on Feb 10, 2025

initial commit

d9bb8f5
verified

Thomas-Chou commited on Feb 10, 2025

Commit History

End of training e89fd66 verified

Model save e76c580 verified

Training in progress, step 5600 eefe861 verified

Training in progress, step 4800 f0d1165 verified

Training in progress, step 3200 b3f07c4 verified

Training in progress, step 2400 287f65c verified

Training in progress, step 1600 588ce34 verified

Training in progress, step 800 5635361 verified

End of training 9fc151c verified

Model save 15191f6 verified

Training in progress, step 2400 1cb3f62 verified

Training in progress, step 1600 909a33e verified

Training in progress, step 800 1c7983a verified

End of training 71ef1a3 verified

Model save 86cdb62 verified

initial commit d9bb8f5 verified

End of training

e89fd66
verified

Model save

e76c580
verified

Training in progress, step 5600

eefe861
verified

Training in progress, step 4800

f0d1165
verified

Training in progress, step 3200

b3f07c4
verified

Training in progress, step 2400

287f65c
verified

Training in progress, step 1600

588ce34
verified

Training in progress, step 800

5635361
verified

End of training

9fc151c
verified

Model save

15191f6
verified

Training in progress, step 2400

1cb3f62
verified

Training in progress, step 1600

909a33e
verified

Training in progress, step 800

1c7983a
verified

End of training

71ef1a3
verified

Model save

86cdb62
verified

initial commit

d9bb8f5
verified