tiiuae
/

Falcon-E-3B-Instruct

Text Generation

text-generation-inference

8-bit precision

Model card Files Files and versions

ybelkada commited on Jul 10, 2025

Commit

5eb3cf8

·

verified ·

1 Parent(s): 958cabc

Update README.md

Files changed (1) hide show

README.md +11 -0

README.md CHANGED Viewed

@@ -88,6 +88,17 @@ python setup_env.py --hf-repo tiiuae/Falcon-E-3B-Instruct -q i2_s
 python run_inference.py -m models/Falcon-E-3B-Instruct/ggml-model-i2_s.gguf -p "You are a helpful assistant" -cnv
 ```
 ### Fine-tuning
 For fine-tuning the model, you should load the `prequantized` revision of the model and use the `onebitllms` Python package:

 python run_inference.py -m models/Falcon-E-3B-Instruct/ggml-model-i2_s.gguf -p "You are a helpful assistant" -cnv
 ```
+#### mlx-lm
+```
+pip install -U mlx-lm
+```
+Then:
+```
+mlx_lm.generate --model tiiuae/Falcon-E-3B-Instruct --prompt "Implement bubble sort" --max-tokens 100 --temp 0.1
+```
 ### Fine-tuning
 For fine-tuning the model, you should load the `prequantized` revision of the model and use the `onebitllms` Python package: