NightForger
/

saiga_nemo_12b-GPTQ

Text Generation

4-bit precision

Model card Files Files and versions

NightForger commited on Oct 28, 2025

Commit

afe72b7

·

verified ·

1 Parent(s): 04b00b6

Update README.md

Files changed (1) hide show

README.md +2 -2

README.md CHANGED Viewed

@@ -21,7 +21,7 @@ quantized_by: NightForger
 It is just fast gptq 4q version of [this model](https://huggingface.co/IlyaGusev/saiga_nemo_12b).
 # Quantize config:
-```
 {
   "bits": 4,
   "group_size": 128,
@@ -41,7 +41,7 @@ It is just fast gptq 4q version of [this model](https://huggingface.co/IlyaGusev
 1024 examples from [SFT set](https://huggingface.co/datasets/IlyaGusev/saiga_scored).
 # Code example (roleplay):
-```
 # Please don`t use this code (try vllm or exllama)
 import torch
 from transformers import AutoTokenizer, AutoConfig, GenerationConfig

 It is just fast gptq 4q version of [this model](https://huggingface.co/IlyaGusev/saiga_nemo_12b).
 # Quantize config:
+```json
 {
   "bits": 4,
   "group_size": 128,
 1024 examples from [SFT set](https://huggingface.co/datasets/IlyaGusev/saiga_scored).
 # Code example (roleplay):
+```python
 # Please don`t use this code (try vllm or exllama)
 import torch
 from transformers import AutoTokenizer, AutoConfig, GenerationConfig