schauppi
/

WizardCoder-1.0-34B

Text Generation

text-generation-inference

4-bit precision

Model card Files Files and versions

WizardCoder-1.0-34B / README.md

schauppi's picture

Update README.md

af3b3f9 over 2 years ago

|

history blame contribute delete

332 Bytes


	```
	sudo docker run --rm \
	-p 8080:80 \
	-e GPTQ_BITS=4 \
	-e GPTQ_GROUPSIZE=128 \
	-e MAX_BEST_OF=1 \
	-e MAX_BATCH_PREFILL_TOKENS=2048 \
	--gpus '"device=0"' \
	-v $PWD/data:/data ghcr.io/huggingface/text-generation-inference:sha-bce5e22 \
	--model-id /data/WizardCoder-Python-34B-V1.0-GPTQ \
	--quantize gptq
	```