Instructions to use techhermit/qwen35-slice14b-base with libraries, inference providers, notebooks, and local apps. Follow these links to get started.

Libraries

How to use techhermit/qwen35-slice14b-base with Transformers:

# Use a pipeline as a high-level helper
from transformers import pipeline

pipe = pipeline("text-generation", model="techhermit/qwen35-slice14b-base")
messages = [
    {"role": "user", "content": "Who are you?"},
]
pipe(messages)

# Load model directly
from transformers import AutoProcessor, AutoModelForCausalLM

processor = AutoProcessor.from_pretrained("techhermit/qwen35-slice14b-base")
model = AutoModelForCausalLM.from_pretrained("techhermit/qwen35-slice14b-base")
messages = [
    {"role": "user", "content": "Who are you?"},
]
inputs = processor.apply_chat_template(
	messages,
	add_generation_prompt=True,
	tokenize=True,
	return_dict=True,
	return_tensors="pt",
).to(model.device)

outputs = model.generate(**inputs, max_new_tokens=40)
print(processor.decode(outputs[0][inputs["input_ids"].shape[-1]:]))

Notebooks
Google Colab
Kaggle
Local Apps Settings

vLLM

How to use techhermit/qwen35-slice14b-base with vLLM:

Install from pip and serve model

# Install vLLM from pip:
pip install vllm
# Start the vLLM server:
vllm serve "techhermit/qwen35-slice14b-base"
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:8000/v1/chat/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "techhermit/qwen35-slice14b-base",
		"messages": [
			{
				"role": "user",
				"content": "What is the capital of France?"
			}
		]
	}'

Use Docker

docker model run hf.co/techhermit/qwen35-slice14b-base

SGLang

How to use techhermit/qwen35-slice14b-base with SGLang:

Install from pip and serve model

# Install SGLang from pip:
pip install sglang
# Start the SGLang server:
python3 -m sglang.launch_server \
    --model-path "techhermit/qwen35-slice14b-base" \
    --host 0.0.0.0 \
    --port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/chat/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "techhermit/qwen35-slice14b-base",
		"messages": [
			{
				"role": "user",
				"content": "What is the capital of France?"
			}
		]
	}'

Use Docker images

docker run --gpus all \
    --shm-size 32g \
    -p 30000:30000 \
    -v ~/.cache/huggingface:/root/.cache/huggingface \
    --env "HF_TOKEN=<secret>" \
    --ipc=host \
    lmsysorg/sglang:latest \
    python3 -m sglang.launch_server \
        --model-path "techhermit/qwen35-slice14b-base" \
        --host 0.0.0.0 \
        --port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/chat/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "techhermit/qwen35-slice14b-base",
		"messages": [
			{
				"role": "user",
				"content": "What is the capital of France?"
			}
		]
	}'

Docker Model Runner
How to use techhermit/qwen35-slice14b-base with Docker Model Runner:
```
docker model run hf.co/techhermit/qwen35-slice14b-base
```

qwen35-slice14b-base

File size: 1,246 Bytes

6012c24

{
  "source_layers": 64,
  "target_layers": 32,
  "selected_layers": [
    0,
    2,
    4,
    6,
    8,
    10,
    12,
    14,
    16,
    18,
    20,
    22,
    24,
    26,
    28,
    30,
    33,
    35,
    37,
    39,
    41,
    43,
    45,
    47,
    49,
    51,
    53,
    55,
    57,
    59,
    61,
    63
  ],
  "layer_map": {
    "0": 0,
    "2": 1,
    "4": 2,
    "6": 3,
    "8": 4,
    "10": 5,
    "12": 6,
    "14": 7,
    "16": 8,
    "18": 9,
    "20": 10,
    "22": 11,
    "24": 12,
    "26": 13,
    "28": 14,
    "30": 15,
    "33": 16,
    "35": 17,
    "37": 18,
    "39": 19,
    "41": 20,
    "43": 21,
    "45": 22,
    "47": 23,
    "49": 24,
    "51": 25,
    "53": 26,
    "55": 27,
    "57": 28,
    "59": 29,
    "61": 30,
    "63": 31
  },
  "target_to_source_map": {
    "0": 0,
    "1": 2,
    "2": 4,
    "3": 6,
    "4": 8,
    "5": 10,
    "6": 12,
    "7": 14,
    "8": 16,
    "9": 18,
    "10": 20,
    "11": 22,
    "12": 24,
    "13": 26,
    "14": 28,
    "15": 30,
    "16": 33,
    "17": 35,
    "18": 37,
    "19": 39,
    "20": 41,
    "21": 43,
    "22": 45,
    "23": 47,
    "24": 49,
    "25": 51,
    "26": 53,
    "27": 55,
    "28": 57,
    "29": 59,
    "30": 61,
    "31": 63
  }
}