TheBloke
/

WizardLM-30B-GPTQ

Text Generation

text-generation-inference

4-bit precision

Model card Files Files and versions

TheBloke commited on Jun 8, 2023

Commit

75a91c2

·

1 Parent(s): 3dbe10b

Update README.md

Files changed (1) hide show

README.md +9 -0

README.md CHANGED Viewed

@@ -29,6 +29,15 @@ It is the result of quantising to 4bit using [GPTQ-for-LLaMa](https://github.com
 * [4-bit, 5-bit and 8-bit GGML models for CPU(+GPU) inference](https://huggingface.co/TheBloke/WizardLM-30B-GGML)
 * [Unquantised fp16 model in pytorch format, for GPU inference and for further conversions](https://huggingface.co/WizardLM/WizardLM-30B-V1.0)
 ## How to easily download and use this model in text-generation-webui
 ### Downloading the model

 * [4-bit, 5-bit and 8-bit GGML models for CPU(+GPU) inference](https://huggingface.co/TheBloke/WizardLM-30B-GGML)
 * [Unquantised fp16 model in pytorch format, for GPU inference and for further conversions](https://huggingface.co/WizardLM/WizardLM-30B-V1.0)
+## Prompt template
+```
+A chat between a curious user and an artificial intelligence assistant.
+The assistant gives helpful, detailed, and polite answers to the user's questions.
+USER: prompt goes here
+ASSISTANT:
+```
 ## How to easily download and use this model in text-generation-webui
 ### Downloading the model