| | --- |
| | license: llama3.2 |
| | language: |
| | - en |
| | tags: |
| | - mlx |
| | base_model: SicariusSicariiStuff/Impish_LLAMA_3B |
| | --- |
| | |
| | # mlx-community/Impish_LLAMA_3B-6bit |
| |
|
| | The Model [mlx-community/Impish_LLAMA_3B-6bit](https://huggingface.co/mlx-community/Impish_LLAMA_3B-6bit) was converted to MLX format from [SicariusSicariiStuff/Impish_LLAMA_3B](https://huggingface.co/SicariusSicariiStuff/Impish_LLAMA_3B) using mlx-lm version **0.20.5**. |
| |
|
| | ## Use with mlx |
| |
|
| | ```bash |
| | pip install mlx-lm |
| | ``` |
| |
|
| | ```python |
| | from mlx_lm import load, generate |
| | |
| | model, tokenizer = load("mlx-community/Impish_LLAMA_3B-6bit") |
| | |
| | prompt="hello" |
| | |
| | if hasattr(tokenizer, "apply_chat_template") and tokenizer.chat_template is not None: |
| | messages = [{"role": "user", "content": prompt}] |
| | prompt = tokenizer.apply_chat_template( |
| | messages, tokenize=False, add_generation_prompt=True |
| | ) |
| | |
| | response = generate(model, tokenizer, prompt=prompt, verbose=True) |
| | ``` |
| |
|