Instructions to use fabriziosalmi/mini-coder-1.7b-mlx-4bit with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- MLX
How to use fabriziosalmi/mini-coder-1.7b-mlx-4bit with MLX:
# Download the model from the Hub pip install huggingface_hub[hf_xet] huggingface-cli download --local-dir mini-coder-1.7b-mlx-4bit fabriziosalmi/mini-coder-1.7b-mlx-4bit
- Notebooks
- Google Colab
- Kaggle
- Local Apps
- LM Studio
Update README.md
Browse files
README.md
CHANGED
|
@@ -14,7 +14,7 @@ This is the [ricdomolm/mini-coder-1.7b](https://huggingface.co/ricdomolm/mini-co
|
|
| 14 |
|
| 15 |
The conversion was performed to ensure the best trade-off between inference speed and the quality of the generated code, while keeping the unified RAM footprint to a minimum. I got 86 tps on MacBook Pro M4 16GB by using this model in LMStudio.
|
| 16 |
|
| 17 |
-
##
|
| 18 |
|
| 19 |
You can load and run this model directly in Python using the official `mlx-lm` library.
|
| 20 |
|
|
@@ -57,7 +57,7 @@ response = generate(
|
|
| 57 |
|
| 58 |
```
|
| 59 |
|
| 60 |
-
##
|
| 61 |
|
| 62 |
* **Framework:** MLX
|
| 63 |
* **Bits:** 4
|
|
|
|
| 14 |
|
| 15 |
The conversion was performed to ensure the best trade-off between inference speed and the quality of the generated code, while keeping the unified RAM footprint to a minimum. I got 86 tps on MacBook Pro M4 16GB by using this model in LMStudio.
|
| 16 |
|
| 17 |
+
## How to use it with MLX
|
| 18 |
|
| 19 |
You can load and run this model directly in Python using the official `mlx-lm` library.
|
| 20 |
|
|
|
|
| 57 |
|
| 58 |
```
|
| 59 |
|
| 60 |
+
## Quantization Details
|
| 61 |
|
| 62 |
* **Framework:** MLX
|
| 63 |
* **Bits:** 4
|