Qwen3-8B-TRNQ

Qwen3-8B-TRNQ is a Trillim-packaged, requantized build of Qwen/Qwen3-8B, intended for efficient text generation with the Trillim inference engine.

Overview

Based on Qwen’s Qwen3-8B release
Requantized and packaged for Trillim runtime
Includes tokenizer, chat template, and Trillim-specific runtime artifacts
Distributed under the upstream Apache 2.0 license with attribution preserved

Model Details

Item	Value
Architecture	Qwen3-8B dense
Parameters	8B class
Source model	`Qwen/Qwen3-8B`
Packaging	Trillim requantized bundle
License	Apache 2.0

Usage

pip install trillim
trillim pull Trillim/Qwen3-8B-TRNQ
trillim chat Trillim/Qwen3-8B-TRNQ

This launches an interactive CLI chat session.

Repository Contents

File	Description
`qmodel.tensors`	Quantized weights in Trillim format
`rope.cache`	Precomputed RoPE cache for runtime
`config.json`	Model configuration
`generation_config.json`	Generation defaults
`trillim_config.json`	Trillim runtime metadata
`tokenizer.json`	Tokenizer data
`tokenizer_config.json`	Tokenizer configuration
`vocab.json`, `merges.txt`	Tokenizer assets