comoZ
/

tri-21b-bnb4

Text Generation

text-generation-inference

4-bit precision

Model card Files Files and versions

comoZ commited on Dec 22, 2025

Commit

d00f3dd

·

verified ·

1 Parent(s): 03e050c

Upload LlamaForCausalLM

Files changed (2) hide show

README.md +0 -2
config.json +1 -1

README.md CHANGED Viewed

@@ -140,8 +140,6 @@ extra_gated_heading: Please be sure to provide your full legal name, date of bir
 **Tri-21B**를 4bit으로 양자화한 모델
-https://huggingface.co/trillionlabs/Tri-21B
 We introduce **Tri-21B**, our flagship large language model that redefines the efficiency frontier in LLM training. By achieving state-of-the-art performance with only 2.3T training tokens, we demonstrate that exceptional capabilities don't require excessive computational resources.
 <p align="center">

 **Tri-21B**를 4bit으로 양자화한 모델
 We introduce **Tri-21B**, our flagship large language model that redefines the efficiency frontier in LLM training. By achieving state-of-the-art performance with only 2.3T training tokens, we demonstrate that exceptional capabilities don't require excessive computational resources.
 <p align="center">

config.json CHANGED Viewed

@@ -23,7 +23,7 @@
   "quantization_config": {
     "_load_in_4bit": true,
     "_load_in_8bit": false,
-    "bnb_4bit_compute_dtype": "float16",
     "bnb_4bit_quant_storage": "uint8",
     "bnb_4bit_quant_type": "nf4",
     "bnb_4bit_use_double_quant": false,

   "quantization_config": {
     "_load_in_4bit": true,
     "_load_in_8bit": false,
+    "bnb_4bit_compute_dtype": "float32",
     "bnb_4bit_quant_storage": "uint8",
     "bnb_4bit_quant_type": "nf4",
     "bnb_4bit_use_double_quant": false,