Instructions to use Thrillcrazyer/QWEN7_GRPO with libraries, inference providers, notebooks, and local apps. Follow these links to get started.

Libraries

How to use Thrillcrazyer/QWEN7_GRPO with Transformers:

# Use a pipeline as a high-level helper
from transformers import pipeline

pipe = pipeline("text-generation", model="Thrillcrazyer/QWEN7_GRPO")
messages = [
    {"role": "user", "content": "Who are you?"},
]
pipe(messages)

# Load model directly
from transformers import AutoTokenizer, AutoModelForCausalLM

tokenizer = AutoTokenizer.from_pretrained("Thrillcrazyer/QWEN7_GRPO")
model = AutoModelForCausalLM.from_pretrained("Thrillcrazyer/QWEN7_GRPO")
messages = [
    {"role": "user", "content": "Who are you?"},
]
inputs = tokenizer.apply_chat_template(
	messages,
	add_generation_prompt=True,
	tokenize=True,
	return_dict=True,
	return_tensors="pt",
).to(model.device)

outputs = model.generate(**inputs, max_new_tokens=40)
print(tokenizer.decode(outputs[0][inputs["input_ids"].shape[-1]:]))

Inference
Notebooks
Google Colab
Kaggle
Local Apps Settings

vLLM

How to use Thrillcrazyer/QWEN7_GRPO with vLLM:

Install from pip and serve model

# Install vLLM from pip:
pip install vllm
# Start the vLLM server:
vllm serve "Thrillcrazyer/QWEN7_GRPO"
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:8000/v1/chat/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "Thrillcrazyer/QWEN7_GRPO",
		"messages": [
			{
				"role": "user",
				"content": "What is the capital of France?"
			}
		]
	}'

Use Docker

docker model run hf.co/Thrillcrazyer/QWEN7_GRPO

SGLang

How to use Thrillcrazyer/QWEN7_GRPO with SGLang:

Install from pip and serve model

# Install SGLang from pip:
pip install sglang
# Start the SGLang server:
python3 -m sglang.launch_server \
    --model-path "Thrillcrazyer/QWEN7_GRPO" \
    --host 0.0.0.0 \
    --port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/chat/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "Thrillcrazyer/QWEN7_GRPO",
		"messages": [
			{
				"role": "user",
				"content": "What is the capital of France?"
			}
		]
	}'

Use Docker images

docker run --gpus all \
    --shm-size 32g \
    -p 30000:30000 \
    -v ~/.cache/huggingface:/root/.cache/huggingface \
    --env "HF_TOKEN=<secret>" \
    --ipc=host \
    lmsysorg/sglang:latest \
    python3 -m sglang.launch_server \
        --model-path "Thrillcrazyer/QWEN7_GRPO" \
        --host 0.0.0.0 \
        --port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/chat/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "Thrillcrazyer/QWEN7_GRPO",
		"messages": [
			{
				"role": "user",
				"content": "What is the capital of France?"
			}
		]
	}'

Docker Model Runner
How to use Thrillcrazyer/QWEN7_GRPO with Docker Model Runner:
```
docker model run hf.co/Thrillcrazyer/QWEN7_GRPO
```

QWEN7_GRPO

Commit History

End of training

0cbbbb3
verified

Thrillcrazyer commited on Nov 29, 2025

Training in progress, step 1000

82d6fbf
verified

Thrillcrazyer commited on Nov 29, 2025

Training in progress, step 900

f2b5bdd
verified

Thrillcrazyer commited on Nov 29, 2025

Training in progress, step 800

858654a
verified

Thrillcrazyer commited on Nov 29, 2025

Training in progress, step 700

21ccd38
verified

Thrillcrazyer commited on Nov 29, 2025

Training in progress, step 600

6323b89
verified

Thrillcrazyer commited on Nov 29, 2025

Training in progress, step 500

3a4b152
verified

Thrillcrazyer commited on Nov 29, 2025

Training in progress, step 400

cf550af
verified

Thrillcrazyer commited on Nov 29, 2025

Training in progress, step 300

4a48dea
verified

Thrillcrazyer commited on Nov 29, 2025

Training in progress, step 200

2f98b2a
verified

Thrillcrazyer commited on Nov 29, 2025

Training in progress, step 100

cf0cf2a
verified

Thrillcrazyer commited on Nov 29, 2025

initial commit

878b76f
verified

Thrillcrazyer commited on Nov 27, 2025

Commit History

End of training 0cbbbb3 verified

Training in progress, step 1000 82d6fbf verified

Training in progress, step 900 f2b5bdd verified

Training in progress, step 800 858654a verified

Training in progress, step 700 21ccd38 verified

Training in progress, step 600 6323b89 verified

Training in progress, step 500 3a4b152 verified

Training in progress, step 400 cf550af verified

Training in progress, step 300 4a48dea verified

Training in progress, step 200 2f98b2a verified

Training in progress, step 100 cf0cf2a verified

initial commit 878b76f verified

End of training

0cbbbb3
verified

Training in progress, step 1000

82d6fbf
verified

Training in progress, step 900

f2b5bdd
verified

Training in progress, step 800

858654a
verified

Training in progress, step 700

21ccd38
verified

Training in progress, step 600

6323b89
verified

Training in progress, step 500

3a4b152
verified

Training in progress, step 400

cf550af
verified

Training in progress, step 300

4a48dea
verified

Training in progress, step 200

2f98b2a
verified

Training in progress, step 100

cf0cf2a
verified

initial commit

878b76f
verified