mlx-community
/

dbrx-instruct-4bit

@@ -93,11 +93,27 @@ if the mlx-lm package was updated it can also be installed from pip:
 pip install mlx-lm
 ```
 ```python
 from mlx_lm import load, generate
-model, tokenizer = load("mlx-community/dbrx-instruct-4bit")
-response = generate(model, tokenizer, prompt="hello", verbose=True)
-```
 Converted and uploaded by [eek](https://huggingface.co/eek)

 pip install mlx-lm
 ```
+To use it from Python you can do the following:
 ```python
 from mlx_lm import load, generate
+model, tokenizer = load(
+   "/Users/eek/work/dbrx-instruct-4bit/",
+   tokenizer_config={"trust_remote_code": True}
+)
+chat = [
+   {"role": "user", "content": "What's the difference between PCA vs UMAP vs t-SNE?"},
+   # We need to add the Assistant role as well, otherwise mlx_lm will error on generation.
+   {"role": "assistant", "content": "The "},
+]
+prompt = tokenizer.apply_chat_template(chat, tokenize=False)
+# We need to remove the last <|im_end|> token so that the AI continues generation
+prompt = prompt[::-1].replace("<|im_end|>"[::-1], "", 1)[::-1]
+response = generate(model, tokenizer, prompt=prompt, verbose=True, temp=0.6, max_tokens=1500)```
 Converted and uploaded by [eek](https://huggingface.co/eek)