Qwen3-R1 4B 🚀

-GGUF version https://huggingface.co/Ali-Yaser/Qwen3-R1-4B-gguf

Model Size License Base Model

Model Description

Qwen3-R1 Series is a specialized math and reansoning awnsers-focused fine-tuned version of Qwen3-8B Instruct, optimized for Math and hard question tasks.

📊 Model Details

  • Developed by: Ali-Yaser
  • Model type: GRPO thinker
  • Base Model: Qwen/Qwen3-8B
  • Model Size: 4B parameters
  • License: Apache 2.0
  • Language(s): English
  • Finetuned from: Qwen3-4B

🚀 Quick Start

Installation

I use a vLLM

# Install vLLM from pip:
pip install vllm

and lets download the model and run model


# Load and run the model:
vllm serve "Ali-Yaser/Qwen3-R1-4B"

and Run it this is example #

# Call the server using curl:
curl -X POST "http://localhost:8000/v1/chat/completions" \
    -H "Content-Type: application/json" \
    --data '{
        "model": "Ali-Yaser/Qwen3-R1-4B",
        "messages": [
            {
                "role": "user",
                "content": "1+434x434+10x22=?"
            }
        ]
    }'
Downloads last month
-
Safetensors
Model size
4B params
Tensor type
BF16
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for Ali-Yaser/Qwen3-R1-4B

Base model

Qwen/Qwen3-4B-Base
Finetuned
Qwen/Qwen3-4B
Finetuned
(408)
this model
Quantizations
1 model