---
license: apache-2.0
base_model:
- Qwen/Qwen3-14B
---
# Nano-Raccoon-Preview-1104
Prototyping checkpoint for NeAR-specialized SLM. Deployment friendly to single consumer GPU.
This model is a light SFT version from Qwen/Qwen3-14B, aimed at stable generative behavior on NeAR agent scaffold.
## Serve with vllm
**Single GPU**
```
vllm serve billxbf/Nano-Raccoon-Preview-1104 \
--trust-remote-code \
--host 0.0.0.0 \
--port 8000
```
**Use Tensor Parallel on 8xGPU**
```
vllm serve billxbf/Nano-Raccoon-Preview-1104 \
--tensor-parallel-size 8 \
--trust-remote-code \
--host 0.0.0.0 \
--port 8000
```