autobots
/

opt-6b-4-bit

Model card Files Files and versions

opt-6b-4-bit / README.md

autobots's picture

Update README.md

34bf60d almost 3 years ago

|

history blame contribute delete

589 Bytes

	---
	license: other
	---
	Quantized for the older GPTQ before it broke all the models.

	Use with

	https://github.com/Ph0rk0z/text-generation-webui-testing

	https://github.com/Ph0rk0z/GPTQ-Merged

	https://github.com/Curlypla/peft-GPTQ


	Clone the 2 repos into text-generation-webui-testing/repositories

	python cuda_setup.py install inside GPTQ-Merged to compile nvidia kernel.


	python server.py --cai-chat --gptq-bits 4 --model opt-6b --autograd


	Don't forget to get configs from: https://huggingface.co/facebook/opt-6.7b/tree/main

	You only need the json files, don't forget merges.txt