Update README.md
Browse files
README.md
CHANGED
|
@@ -88,6 +88,8 @@ print(tokenizer.decode(output[0][input_ids.shape[-1]:], skip_special_tokens=True
|
|
| 88 |
We recommend deploying the model with the lastest version of [vLLM](https://github.com/vllm-project/vllm).
|
| 89 |
|
| 90 |
```bash
|
|
|
|
|
|
|
| 91 |
vllm serve trillionlabs/nemotron-slide \
|
| 92 |
--tensor-parallel-size 1 \
|
| 93 |
--port 8000 \
|
|
|
|
| 88 |
We recommend deploying the model with the lastest version of [vLLM](https://github.com/vllm-project/vllm).
|
| 89 |
|
| 90 |
```bash
|
| 91 |
+
wget https://huggingface.co/trillionlabs/nemotron-slide/blob/main/nano_v3_reasoning_parser.py
|
| 92 |
+
|
| 93 |
vllm serve trillionlabs/nemotron-slide \
|
| 94 |
--tensor-parallel-size 1 \
|
| 95 |
--port 8000 \
|