trillionlabs
/

Tri-21B-Think-Preview

Text Generation

Model card Files Files and versions

WonsukYangTL commited on Feb 19

Commit

313e227

·

verified ·

1 Parent(s): c05b1c9

Update README.md

Files changed (1) hide show

README.md +2 -16

README.md CHANGED Viewed

@@ -79,23 +79,9 @@ response = tokenizer.batch_decode(generated_ids, skip_special_tokens=True)[0]
 print(response)
 ```
-### vLLM Deployment
-```bash
-vllm serve trillionlabs/Tri-21B-Think-Preview \
-    --dtype bfloat16 \
-    --max-model-len 32768 \
-    --tensor-parallel-size 8 \
-    --reasoning-parser qwen3 \
-    --enable-auto-tool-choice \
-    --tool-call-parser hermes
-```
-### SGLang Deployment
-```bash
-python3 -m sglang.launch_server --model-path trillionlabs/Tri-21B-Think-Preview --dtype bfloat16 --context-length 32768
-```
 ## Fine-tuning Notes

 print(response)
 ```
+### vLLM & SGLang Deployment
+vLLM and SGLang support for Trillion Model is on the way. Stay tuned!
 ## Fine-tuning Notes