Instructions to use May811/Qwen2.5-1.5B-Open-R1-GRPO with libraries, inference providers, notebooks, and local apps. Follow these links to get started.

Libraries

How to use May811/Qwen2.5-1.5B-Open-R1-GRPO with Transformers:

# Use a pipeline as a high-level helper
from transformers import pipeline

pipe = pipeline("text-generation", model="May811/Qwen2.5-1.5B-Open-R1-GRPO")
messages = [
    {"role": "user", "content": "Who are you?"},
]
pipe(messages)

# Load model directly
from transformers import AutoTokenizer, AutoModelForCausalLM

tokenizer = AutoTokenizer.from_pretrained("May811/Qwen2.5-1.5B-Open-R1-GRPO")
model = AutoModelForCausalLM.from_pretrained("May811/Qwen2.5-1.5B-Open-R1-GRPO")
messages = [
    {"role": "user", "content": "Who are you?"},
]
inputs = tokenizer.apply_chat_template(
	messages,
	add_generation_prompt=True,
	tokenize=True,
	return_dict=True,
	return_tensors="pt",
).to(model.device)

outputs = model.generate(**inputs, max_new_tokens=40)
print(tokenizer.decode(outputs[0][inputs["input_ids"].shape[-1]:]))

Notebooks
Google Colab
Kaggle
Local Apps Settings

vLLM

How to use May811/Qwen2.5-1.5B-Open-R1-GRPO with vLLM:

Install from pip and serve model

# Install vLLM from pip:
pip install vllm
# Start the vLLM server:
vllm serve "May811/Qwen2.5-1.5B-Open-R1-GRPO"
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:8000/v1/chat/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "May811/Qwen2.5-1.5B-Open-R1-GRPO",
		"messages": [
			{
				"role": "user",
				"content": "What is the capital of France?"
			}
		]
	}'

Use Docker

docker model run hf.co/May811/Qwen2.5-1.5B-Open-R1-GRPO

SGLang

How to use May811/Qwen2.5-1.5B-Open-R1-GRPO with SGLang:

Install from pip and serve model

# Install SGLang from pip:
pip install sglang
# Start the SGLang server:
python3 -m sglang.launch_server \
    --model-path "May811/Qwen2.5-1.5B-Open-R1-GRPO" \
    --host 0.0.0.0 \
    --port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/chat/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "May811/Qwen2.5-1.5B-Open-R1-GRPO",
		"messages": [
			{
				"role": "user",
				"content": "What is the capital of France?"
			}
		]
	}'

Use Docker images

docker run --gpus all \
    --shm-size 32g \
    -p 30000:30000 \
    -v ~/.cache/huggingface:/root/.cache/huggingface \
    --env "HF_TOKEN=<secret>" \
    --ipc=host \
    lmsysorg/sglang:latest \
    python3 -m sglang.launch_server \
        --model-path "May811/Qwen2.5-1.5B-Open-R1-GRPO" \
        --host 0.0.0.0 \
        --port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/chat/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "May811/Qwen2.5-1.5B-Open-R1-GRPO",
		"messages": [
			{
				"role": "user",
				"content": "What is the capital of France?"
			}
		]
	}'

Docker Model Runner
How to use May811/Qwen2.5-1.5B-Open-R1-GRPO with Docker Model Runner:
```
docker model run hf.co/May811/Qwen2.5-1.5B-Open-R1-GRPO
```

Qwen2.5-1.5B-Open-R1-GRPO

Commit History

Model save

9afce01
verified

May811 commited on Feb 15, 2025

Training in progress, step 1131

e52d49f
verified

May811 commited on Feb 15, 2025

Model save

e3b8fb1
verified

May811 commited on Feb 15, 2025

Training in progress, step 2263

5113781
verified

May811 commited on Feb 15, 2025

Training in progress, step 1000

7a36eb8
verified

May811 commited on Feb 14, 2025

Training in progress, step 2000

c7b67e4
verified

May811 commited on Feb 14, 2025

End of training

717ee10
verified

May811 commited on Feb 14, 2025

Model save

21707f8
verified

May811 commited on Feb 14, 2025

Training in progress, step 800

45aa460
verified

May811 commited on Feb 13, 2025

Training in progress, step 1600

e15e270
verified

May811 commited on Feb 13, 2025

Training in progress, step 600

b544b02
verified

May811 commited on Feb 12, 2025

Training in progress, step 1200

431ae23
verified

May811 commited on Feb 12, 2025

Training in progress, step 400

a4bfe97
verified

May811 commited on Feb 12, 2025

Training in progress, step 800

6c28f61
verified

May811 commited on Feb 11, 2025

Training in progress, step 400

e87b36e
verified

May811 commited on Feb 11, 2025

initial commit

df72350
verified

May811 commited on Feb 5, 2025

Commit History

Model save 9afce01 verified

Training in progress, step 1131 e52d49f verified

Model save e3b8fb1 verified

Training in progress, step 2263 5113781 verified

Training in progress, step 1000 7a36eb8 verified

Training in progress, step 2000 c7b67e4 verified

End of training 717ee10 verified

Model save 21707f8 verified

Training in progress, step 800 45aa460 verified

Training in progress, step 1600 e15e270 verified

Training in progress, step 600 b544b02 verified

Training in progress, step 1200 431ae23 verified

Training in progress, step 400 a4bfe97 verified

Training in progress, step 800 6c28f61 verified

Training in progress, step 400 e87b36e verified

initial commit df72350 verified

Model save

9afce01
verified

Training in progress, step 1131

e52d49f
verified

Model save

e3b8fb1
verified

Training in progress, step 2263

5113781
verified

Training in progress, step 1000

7a36eb8
verified

Training in progress, step 2000

c7b67e4
verified

End of training

717ee10
verified

Model save

21707f8
verified

Training in progress, step 800

45aa460
verified

Training in progress, step 1600

e15e270
verified

Training in progress, step 600

b544b02
verified

Training in progress, step 1200

431ae23
verified

Training in progress, step 400

a4bfe97
verified

Training in progress, step 800

6c28f61
verified

Training in progress, step 400

e87b36e
verified

initial commit

df72350
verified