Qwen3-R1 8B 🚀

Model Size License Base Model

Model Description

Qwen3-R1 Series is a specialized math and reansoning awnsers-focused fine-tuned version of Qwen3-8B Instruct, optimized for Math and hard question tasks.

📊 Model Details

  • Developed by: Ali-Yaser
  • Model type: GRPO thinker
  • Base Model: Qwen/Qwen3-8B
  • Model Size: 8B parameters
  • License: Apache 2.0
  • Language(s): English
  • Finetuned from: Qwen3-8B

🚀 Quick Start

Installation

I use a vLLM

# Install vLLM from pip:
pip install vllm

and lets download the model and run model


# Load and run the model:
vllm serve "Ali-Yaser/Qwen3-R1-8B"

and Run it this is example #

# Call the server using curl:
curl -X POST "http://localhost:8000/v1/chat/completions" \
    -H "Content-Type: application/json" \
    --data '{
        "model": "Ali-Yaser/Qwen3-R1-8B",
        "messages": [
            {
                "role": "user",
                "content": "1+434334434+10x22=?"
            }
        ]
    }'
Downloads last month
45
Safetensors
Model size
8B params
Tensor type
BF16
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for Ali-Yaser/Qwen3-R1-8B

Base model

Qwen/Qwen3-8B-Base
Finetuned
Qwen/Qwen3-8B
Finetuned
(868)
this model