Qwen3-8B-TRNQ

Qwen3-8B-TRNQ is a Trillim-packaged, requantized build of Qwen/Qwen3-8B, intended for efficient text generation with the Trillim inference engine.

Overview

  • Based on Qwen’s Qwen3-8B release
  • Requantized and packaged for Trillim runtime
  • Includes tokenizer, chat template, and Trillim-specific runtime artifacts
  • Distributed under the upstream Apache 2.0 license with attribution preserved

Model Details

Item Value
Architecture Qwen3-8B dense
Parameters 8B class
Source model Qwen/Qwen3-8B
Packaging Trillim requantized bundle
License Apache 2.0

Usage

pip install trillim
trillim pull Trillim/Qwen3-8B-TRNQ
trillim chat Trillim/Qwen3-8B-TRNQ

This launches an interactive CLI chat session.

Repository Contents

File Description
qmodel.tensors Quantized weights in Trillim format
rope.cache Precomputed RoPE cache for runtime
config.json Model configuration
generation_config.json Generation defaults
trillim_config.json Trillim runtime metadata
tokenizer.json Tokenizer data
tokenizer_config.json Tokenizer configuration
vocab.json, merges.txt Tokenizer assets

Provenance

This repository is derived from:

Changes made by Trillim:

  • Requantized from the upstream release
  • Repackaged for the Trillim inference engine

This repository is not affiliated with or endorsed by Alibaba Cloud.

License

Released under the Apache 2.0 License, consistent with the upstream model license.

See:

  • LICENSE
Downloads last month
35
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for Trillim/Qwen3-8B-TRNQ

Finetuned
Qwen/Qwen3-8B
Finetuned
(1641)
this model

Collection including Trillim/Qwen3-8B-TRNQ