Dense Qwen3
Collection
Dense Qwen3 models quantized and tuned for Trillim • 6 items • Updated
Qwen3-8B-TRNQ is a Trillim-packaged, requantized build of Qwen/Qwen3-8B, intended for efficient text generation with the Trillim inference engine.
| Item | Value |
|---|---|
| Architecture | Qwen3-8B dense |
| Parameters | 8B class |
| Source model | Qwen/Qwen3-8B |
| Packaging | Trillim requantized bundle |
| License | Apache 2.0 |
pip install trillim
trillim pull Trillim/Qwen3-8B-TRNQ
trillim chat Trillim/Qwen3-8B-TRNQ
This launches an interactive CLI chat session.
| File | Description |
|---|---|
qmodel.tensors |
Quantized weights in Trillim format |
rope.cache |
Precomputed RoPE cache for runtime |
config.json |
Model configuration |
generation_config.json |
Generation defaults |
trillim_config.json |
Trillim runtime metadata |
tokenizer.json |
Tokenizer data |
tokenizer_config.json |
Tokenizer configuration |
vocab.json, merges.txt |
Tokenizer assets |
This repository is derived from:
Changes made by Trillim:
This repository is not affiliated with or endorsed by Alibaba Cloud.
Released under the Apache 2.0 License, consistent with the upstream model license.
See:
LICENSE