dranger003
/

dbrx-instruct-iMat.GGUF

Text Generation

Model card Files Files and versions

dranger003 commited on Apr 13, 2024

Commit

f8879cf

·

verified ·

1 Parent(s): 7554009

Update README.md

Files changed (1) hide show

README.md +3 -1

README.md CHANGED Viewed

@@ -11,7 +11,9 @@ base_model: databricks/dbrx-instruct
 **Apple/Metal support task is still open, missing clamp op implementation - this means some quants here won't work on Metal.**
 **All quants in this repo have been tested successfully running the following command:**
-`./build/bin/main -ngl 41 -s 0 -e -p "<|im_start|>system\nYou are a helpful assistant.<|im_end|>\n<|im_start|>user\nWrite an essay about AI.<|im_end|>\n<|im_start|>assistant\n" -m ggml-dbrx-instruct-16x12b-<<quant-to-test>>.gguf`
 * GGUF importance matrix (imatrix) quants for https://huggingface.co/databricks/dbrx-instruct
 * The importance matrix is trained for ~100K tokens (200 batches of 512 tokens) using [wiki.train.raw](https://huggingface.co/datasets/wikitext).

 **Apple/Metal support task is still open, missing clamp op implementation - this means some quants here won't work on Metal.**
 **All quants in this repo have been tested successfully running the following command:**
+```
+./build/bin/main -ngl 41 -s 0 -e -p "<|im_start|>system\nYou are a helpful assistant.<|im_end|>\n<|im_start|>user\nWrite an essay about AI.<|im_end|>\n<|im_start|>assistant\n" -m ggml-dbrx-instruct-16x12b-<<quant-to-test>>.gguf
+```
 * GGUF importance matrix (imatrix) quants for https://huggingface.co/databricks/dbrx-instruct
 * The importance matrix is trained for ~100K tokens (200 batches of 512 tokens) using [wiki.train.raw](https://huggingface.co/datasets/wikitext).