autobots
/

opt-6b-4-bit

Model card Files Files and versions

autobots commited on Mar 26, 2023

Commit

34bf60d

·

1 Parent(s): cab66e9

Update README.md

Files changed (1) hide show

README.md +6 -2

README.md CHANGED Viewed

@@ -6,11 +6,14 @@ Quantized for the older GPTQ before it broke all the models.
 Use with
 https://github.com/Ph0rk0z/text-generation-webui-testing
 https://github.com/Ph0rk0z/GPTQ-Merged
 https://github.com/Curlypla/peft-GPTQ
-clone the 2 repos into text-generation-webui-testing/repositories
 python cuda_setup.py install inside GPTQ-Merged to compile nvidia kernel.
@@ -18,5 +21,6 @@ python server.py --cai-chat --gptq-bits 4 --model opt-6b --autograd
 Don't forget to get configs from: https://huggingface.co/facebook/opt-6.7b/tree/main
-you only need the json files, don't forget merges.txt

 Use with
 https://github.com/Ph0rk0z/text-generation-webui-testing
 https://github.com/Ph0rk0z/GPTQ-Merged
 https://github.com/Curlypla/peft-GPTQ
+Clone the 2 repos into text-generation-webui-testing/repositories
 python cuda_setup.py install inside GPTQ-Merged to compile nvidia kernel.
 Don't forget to get configs from: https://huggingface.co/facebook/opt-6.7b/tree/main
+You only need the json files, don't forget merges.txt