Instructions to use YCWTG/Qwen3-Coder-Next-int2-mixed-AutoRound with libraries, inference providers, notebooks, and local apps. Follow these links to get started.

Libraries

How to use YCWTG/Qwen3-Coder-Next-int2-mixed-AutoRound with Transformers:

# Use a pipeline as a high-level helper
from transformers import pipeline

pipe = pipeline("text-generation", model="YCWTG/Qwen3-Coder-Next-int2-mixed-AutoRound")
messages = [
    {"role": "user", "content": "Who are you?"},
]
pipe(messages)

# Load model directly
from transformers import AutoTokenizer, AutoModelForCausalLM

tokenizer = AutoTokenizer.from_pretrained("YCWTG/Qwen3-Coder-Next-int2-mixed-AutoRound")
model = AutoModelForCausalLM.from_pretrained("YCWTG/Qwen3-Coder-Next-int2-mixed-AutoRound")
messages = [
    {"role": "user", "content": "Who are you?"},
]
inputs = tokenizer.apply_chat_template(
	messages,
	add_generation_prompt=True,
	tokenize=True,
	return_dict=True,
	return_tensors="pt",
).to(model.device)

outputs = model.generate(**inputs, max_new_tokens=40)
print(tokenizer.decode(outputs[0][inputs["input_ids"].shape[-1]:]))

Notebooks
Google Colab
Kaggle
Local Apps Settings

vLLM

How to use YCWTG/Qwen3-Coder-Next-int2-mixed-AutoRound with vLLM:

Install from pip and serve model

# Install vLLM from pip:
pip install vllm
# Start the vLLM server:
vllm serve "YCWTG/Qwen3-Coder-Next-int2-mixed-AutoRound"
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:8000/v1/chat/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "YCWTG/Qwen3-Coder-Next-int2-mixed-AutoRound",
		"messages": [
			{
				"role": "user",
				"content": "What is the capital of France?"
			}
		]
	}'

Use Docker

docker model run hf.co/YCWTG/Qwen3-Coder-Next-int2-mixed-AutoRound

SGLang

How to use YCWTG/Qwen3-Coder-Next-int2-mixed-AutoRound with SGLang:

Install from pip and serve model

# Install SGLang from pip:
pip install sglang
# Start the SGLang server:
python3 -m sglang.launch_server \
    --model-path "YCWTG/Qwen3-Coder-Next-int2-mixed-AutoRound" \
    --host 0.0.0.0 \
    --port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/chat/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "YCWTG/Qwen3-Coder-Next-int2-mixed-AutoRound",
		"messages": [
			{
				"role": "user",
				"content": "What is the capital of France?"
			}
		]
	}'

Use Docker images

docker run --gpus all \
    --shm-size 32g \
    -p 30000:30000 \
    -v ~/.cache/huggingface:/root/.cache/huggingface \
    --env "HF_TOKEN=<secret>" \
    --ipc=host \
    lmsysorg/sglang:latest \
    python3 -m sglang.launch_server \
        --model-path "YCWTG/Qwen3-Coder-Next-int2-mixed-AutoRound" \
        --host 0.0.0.0 \
        --port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/chat/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "YCWTG/Qwen3-Coder-Next-int2-mixed-AutoRound",
		"messages": [
			{
				"role": "user",
				"content": "What is the capital of France?"
			}
		]
	}'

Docker Model Runner
How to use YCWTG/Qwen3-Coder-Next-int2-mixed-AutoRound with Docker Model Runner:
```
docker model run hf.co/YCWTG/Qwen3-Coder-Next-int2-mixed-AutoRound
```

Qwen3-Coder-Next-int2-mixed-AutoRound

Commit History

Update README_zh.md

849e8d3
verified

YCWTG commited on Apr 1

Update README.md

e0890c4
verified

YCWTG commited on Apr 1

Update README_zh.md

b2ac362
verified

YCWTG commited on Mar 29

Update README.md

b4f347f
verified

YCWTG commited on Mar 29

Update README_zh.md

74f4c6d
verified

YCWTG commited on Mar 12

Update README.md

bb94ccf
verified

YCWTG commited on Mar 11

Update README.md

66b17e6
verified

YCWTG commited on Mar 11

Delete qwen3coder_tool_parser_vllm.py

ca3ea92
verified

YCWTG commited on Mar 11

Delete qwen3_coder_detector_sgl.py

a071b48
verified

YCWTG commited on Mar 11

Update README_zh.md

ebbfa22
verified

YCWTG commited on Feb 27

Update README.md

6804588
verified

YCWTG commited on Feb 27

Update README.md

cc43e88
verified

YCWTG commited on Feb 27

Update README_zh.md

6cab9f6
verified

YCWTG commited on Feb 27

Update README_zh.md

d72b289
verified

YCWTG commited on Feb 27

Update README_zh.md

5f21433
verified

YCWTG commited on Feb 27

Update README_zh.md

0ae0d80
verified

YCWTG commited on Feb 27

Update README_zh.md

c8b89d3
verified

YCWTG commited on Feb 27

Update README.md

e5afd06
verified

YCWTG commited on Feb 27

Update README_zh.md

9364f6c
verified

YCWTG commited on Feb 26

Update README.md

3ca2605
verified

YCWTG commited on Feb 26

Update README_zh.md

8f4f6bd
verified

YCWTG commited on Feb 26

Update README.md

347d094
verified

YCWTG commited on Feb 26

Update README.md

3fb42bc
verified

YCWTG commited on Feb 26

Update README_zh.md

cfc1d49
verified

YCWTG commited on Feb 26

Update README.md

fc3d512
verified

YCWTG commited on Feb 25

push all model

11bca80

YCWTG commited on Feb 25

Upload 9 files

45927de
verified

YCWTG commited on Feb 25

Update README.md

e51c9a8
verified

YCWTG commited on Feb 25

Rename readme_zh.txt to README_zh.md

82bbf65
verified

YCWTG commited on Feb 25

Rename readme.txt to README.md

4714665
verified

YCWTG commited on Feb 25

Delete README.md

76b68e5
verified

YCWTG commited on Feb 25

Upload 2 files

d446beb
verified

YCWTG commited on Feb 25

initial commit

f224656
verified

YCWTG commited on Feb 24

Commit History

Update README_zh.md 849e8d3 verified

Update README.md e0890c4 verified

Update README_zh.md b2ac362 verified

Update README.md b4f347f verified

Update README_zh.md 74f4c6d verified

Update README.md bb94ccf verified

Update README.md 66b17e6 verified

Delete qwen3coder_tool_parser_vllm.py ca3ea92 verified

Delete qwen3_coder_detector_sgl.py a071b48 verified

Update README_zh.md ebbfa22 verified

Update README.md 6804588 verified

Update README.md cc43e88 verified

Update README_zh.md 6cab9f6 verified

Update README_zh.md d72b289 verified

Update README_zh.md 5f21433 verified

Update README_zh.md 0ae0d80 verified

Update README_zh.md c8b89d3 verified

Update README.md e5afd06 verified

Update README_zh.md 9364f6c verified

Update README.md 3ca2605 verified

Update README_zh.md 8f4f6bd verified

Update README.md 347d094 verified

Update README.md 3fb42bc verified

Update README_zh.md cfc1d49 verified

Update README.md fc3d512 verified

push all model 11bca80

Upload 9 files 45927de verified

Update README.md e51c9a8 verified

Rename readme_zh.txt to README_zh.md 82bbf65 verified

Rename readme.txt to README.md 4714665 verified

Delete README.md 76b68e5 verified

Upload 2 files d446beb verified

initial commit f224656 verified

Update README_zh.md

849e8d3
verified

Update README.md

e0890c4
verified

Update README_zh.md

b2ac362
verified

Update README.md

b4f347f
verified

Update README_zh.md

74f4c6d
verified

Update README.md

bb94ccf
verified

Update README.md

66b17e6
verified

Delete qwen3coder_tool_parser_vllm.py

ca3ea92
verified

Delete qwen3_coder_detector_sgl.py

a071b48
verified

Update README_zh.md

ebbfa22
verified

Update README.md

6804588
verified

Update README.md

cc43e88
verified

Update README_zh.md

6cab9f6
verified

Update README_zh.md

d72b289
verified

Update README_zh.md

5f21433
verified

Update README_zh.md

0ae0d80
verified

Update README_zh.md

c8b89d3
verified

Update README.md

e5afd06
verified

Update README_zh.md

9364f6c
verified

Update README.md

3ca2605
verified

Update README_zh.md

8f4f6bd
verified

Update README.md

347d094
verified

Update README.md

3fb42bc
verified

Update README_zh.md

cfc1d49
verified

Update README.md

fc3d512
verified

push all model

11bca80

Upload 9 files

45927de
verified

Update README.md

e51c9a8
verified

Rename readme_zh.txt to README_zh.md

82bbf65
verified

Rename readme.txt to README.md

4714665
verified

Delete README.md

76b68e5
verified

Upload 2 files

d446beb
verified

initial commit

f224656
verified