CausalLM
/

7B

Text Generation

text-generation-inference

Model card Files Files and versions

JosephusCheung commited on Nov 21, 2023

Commit

f1f8b2c

·

1 Parent(s): 2a4a0cb

Update README.md

Files changed (1) hide show

README.md +8 -0

README.md CHANGED Viewed

@@ -45,6 +45,10 @@ GPT2Tokenizer fixed by [Kerfuffle](https://github.com/KerfuffleV2) on [https://g
 Thanks TheBloke for GGUF quants: [https://huggingface.co/TheBloke/CausalLM-7B-GGUF](https://huggingface.co/TheBloke/CausalLM-7B-GGUF)
 ## Read Me:
 Also see [14B Version](https://huggingface.co/CausalLM/14B)
@@ -115,6 +119,10 @@ GPT2Tokenizer 支持由 [Kerfuffle](https://github.com/KerfuffleV2) 修复于 [h
 感谢 TheBloke 制作 GGUF 版本量化模型: [https://huggingface.co/TheBloke/CausalLM-7B-GGUF](https://huggingface.co/TheBloke/CausalLM-7B-GGUF)
 ## 请读我：
 另请参阅[14B版本](https://huggingface.co/CausalLM/14B)

 Thanks TheBloke for GGUF quants: [https://huggingface.co/TheBloke/CausalLM-7B-GGUF](https://huggingface.co/TheBloke/CausalLM-7B-GGUF)
+**Caution:** Unofficial GPTQ and AWQ models may have issues as they use Wikitext for calibration, while this model has undergone considerable training on a synthesized Wikipedia conversation dataset.
+It is not recommended to use any form of quantization, but rather to use smaller-sized models, as the 7B and 14B versions have high consistency. However, if you do use model quantization, please use GGUF.
 ## Read Me:
 Also see [14B Version](https://huggingface.co/CausalLM/14B)
 感谢 TheBloke 制作 GGUF 版本量化模型: [https://huggingface.co/TheBloke/CausalLM-7B-GGUF](https://huggingface.co/TheBloke/CausalLM-7B-GGUF)
+**注意：** 非官方 GPTQ 和 AWQ 模型可能存在问题，因为它们使用 Wikitext 进行校准，而该模型已经在合成的 Wikipedia 对话数据集上经过了大量的训练。
+不建议使用任何形式的量化，而是使用较小尺寸的模型，因为7B和14B版本具有较高的一致性。 但是，如果您确实使用模型量化，请使用 GGUF。
 ## 请读我：
 另请参阅[14B版本](https://huggingface.co/CausalLM/14B)