Instructions to use Elliott/LUFFY-Qwen-Math-7B-Zero with libraries, inference providers, notebooks, and local apps. Follow these links to get started.

Libraries

How to use Elliott/LUFFY-Qwen-Math-7B-Zero with Transformers:

# Use a pipeline as a high-level helper
from transformers import pipeline

pipe = pipeline("text-generation", model="Elliott/LUFFY-Qwen-Math-7B-Zero")
messages = [
    {"role": "user", "content": "Who are you?"},
]
pipe(messages)

# Load model directly
from transformers import AutoTokenizer, AutoModelForCausalLM

tokenizer = AutoTokenizer.from_pretrained("Elliott/LUFFY-Qwen-Math-7B-Zero")
model = AutoModelForCausalLM.from_pretrained("Elliott/LUFFY-Qwen-Math-7B-Zero")
messages = [
    {"role": "user", "content": "Who are you?"},
]
inputs = tokenizer.apply_chat_template(
	messages,
	add_generation_prompt=True,
	tokenize=True,
	return_dict=True,
	return_tensors="pt",
).to(model.device)

outputs = model.generate(**inputs, max_new_tokens=40)
print(tokenizer.decode(outputs[0][inputs["input_ids"].shape[-1]:]))

Inference
Notebooks
Google Colab
Kaggle
Local Apps

vLLM

How to use Elliott/LUFFY-Qwen-Math-7B-Zero with vLLM:

Install from pip and serve model

# Install vLLM from pip:
pip install vllm
# Start the vLLM server:
vllm serve "Elliott/LUFFY-Qwen-Math-7B-Zero"
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:8000/v1/chat/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "Elliott/LUFFY-Qwen-Math-7B-Zero",
		"messages": [
			{
				"role": "user",
				"content": "What is the capital of France?"
			}
		]
	}'

Use Docker

docker model run hf.co/Elliott/LUFFY-Qwen-Math-7B-Zero

SGLang

How to use Elliott/LUFFY-Qwen-Math-7B-Zero with SGLang:

Install from pip and serve model

# Install SGLang from pip:
pip install sglang
# Start the SGLang server:
python3 -m sglang.launch_server \
    --model-path "Elliott/LUFFY-Qwen-Math-7B-Zero" \
    --host 0.0.0.0 \
    --port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/chat/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "Elliott/LUFFY-Qwen-Math-7B-Zero",
		"messages": [
			{
				"role": "user",
				"content": "What is the capital of France?"
			}
		]
	}'

Use Docker images

docker run --gpus all \
    --shm-size 32g \
    -p 30000:30000 \
    -v ~/.cache/huggingface:/root/.cache/huggingface \
    --env "HF_TOKEN=<secret>" \
    --ipc=host \
    lmsysorg/sglang:latest \
    python3 -m sglang.launch_server \
        --model-path "Elliott/LUFFY-Qwen-Math-7B-Zero" \
        --host 0.0.0.0 \
        --port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/chat/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "Elliott/LUFFY-Qwen-Math-7B-Zero",
		"messages": [
			{
				"role": "user",
				"content": "What is the capital of France?"
			}
		]
	}'

Docker Model Runner
How to use Elliott/LUFFY-Qwen-Math-7B-Zero with Docker Model Runner:
```
docker model run hf.co/Elliott/LUFFY-Qwen-Math-7B-Zero
```

Elliott commited on Apr 22, 2025

Commit

d1611e6

verified ·

1 Parent(s): 9d3e41a

Update README.md

Browse files

Files changed (1) hide show

README.md +11 -15

README.md CHANGED Viewed

@@ -1,13 +1,13 @@
----
-library_name: transformers
-tags:
-- reasoning
-- Zero-RL
-license: mit
-base_model:
-- Qwen/Qwen2.5-Math-7B
-pipeline_tag: text-generation
----
 # 📖Introduction
 ![Github](https://img.shields.io/badge/LUFFY-000000?style=for-the-badge&logo=github&logoColor=000&logoColor=white)
@@ -60,8 +60,6 @@ LUFFY is evaluated on six competition-level benchmarks, achieving state-of-the-a
 | OpenReasoner-Zero                 | 17.2        | 15.0        | 52.3    | 84.6          | 33.8        | 47.1          | 41.7     |
 | PRIME-Zero                        | 17.9        | 14.7        | 55.2    | 79.4          | **38.2**    | 42.2          | 41.3     |
 | Oat-Zero                          | **31.7**    | 11.0        | 61.6    | 79.2          | 29.8        | 42.5          | 42.6     |
-| SFT (Our repication)                               | 28.6        | **23.5**    | 59.0    | 86.0          | 37.5        | 51.1          | 47.6     |
-| On-Policy RL (Our repication)                      | 24.6        | 15.7        | 61.3    | 84.6          | 34.9        | 47.9          | 44.8     |
 | **LUFFY**                         | 29.5        | 23.2        | **66.1**| **88.4**      | 33.8        | **56.4**      | **49.6** |
 ---
@@ -77,10 +75,8 @@ LUFFY also generalizes well to out-of-distribution tasks, with over +6.2 average
 | Qwen2.5-Math-7B-Instruct         | 70.3      | 24.7             | 34.1         | 43.0     |
 | SimpleRL-Zero                    | 30.2      | 23.2             | 34.5         | 29.3     |
 | OpenReasoner-Zero                | 66.2      | 29.8             | 58.7         | 51.6     |
-| PRIME-Zero                       | **73.3**  | 18.2             | 32.7         | 41.4     |
 | Oat-Zero                         | 70.1      | 23.7             | 41.7         | 45.2     |
-| SFT (Our repication)             | 75.2      | 24.7             | 42.7         | 47.5     |
-| On-Policy RL (Our repication)    | **82.3**  | **40.4**         | _49.3_       | _57.3_   |
 | **LUFFY**                        | _80.5_    | _39.9_           | **53.0**     | **57.8** |
 ---

+---
+library_name: transformers
+tags:
+- reasoning
+- Zero-RL
+license: mit
+base_model:
+- Qwen/Qwen2.5-Math-7B
+pipeline_tag: text-generation
+---
 # 📖Introduction
 ![Github](https://img.shields.io/badge/LUFFY-000000?style=for-the-badge&logo=github&logoColor=000&logoColor=white)
 | OpenReasoner-Zero                 | 17.2        | 15.0        | 52.3    | 84.6          | 33.8        | 47.1          | 41.7     |
 | PRIME-Zero                        | 17.9        | 14.7        | 55.2    | 79.4          | **38.2**    | 42.2          | 41.3     |
 | Oat-Zero                          | **31.7**    | 11.0        | 61.6    | 79.2          | 29.8        | 42.5          | 42.6     |
 | **LUFFY**                         | 29.5        | 23.2        | **66.1**| **88.4**      | 33.8        | **56.4**      | **49.6** |
 ---
 | Qwen2.5-Math-7B-Instruct         | 70.3      | 24.7             | 34.1         | 43.0     |
 | SimpleRL-Zero                    | 30.2      | 23.2             | 34.5         | 29.3     |
 | OpenReasoner-Zero                | 66.2      | 29.8             | 58.7         | 51.6     |
+| PRIME-Zero                       | 73.3      | 18.2             | 32.7         | 41.4     |
 | Oat-Zero                         | 70.1      | 23.7             | 41.7         | 45.2     |
 | **LUFFY**                        | _80.5_    | _39.9_           | **53.0**     | **57.8** |
 ---