Instructions to use nhe-ai/Llasa-3B-mlx-8Bit with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- MLX
How to use nhe-ai/Llasa-3B-mlx-8Bit with MLX:
# Download the model from the Hub pip install huggingface_hub[hf_xet] huggingface-cli download --local-dir Llasa-3B-mlx-8Bit nhe-ai/Llasa-3B-mlx-8Bit
- Notebooks
- Google Colab
- Kaggle
- Local Apps
- LM Studio
Configuration Parsing Warning:Config file tokenizer_config.json cannot be fetched (too big)
nhe-ai/Llasa-3B-mlx-8Bit
The Model nhe-ai/Llasa-3B-mlx-8Bit was converted to MLX format from HKUSTAudio/Llasa-3B using mlx-lm version 0.22.3.
⚠️ Important: This model was automatically converted for experimentation. The following guide was not designed for this model and may not work as expected. Do not expect to function out of the box. Use at your own experimentation.
Use with mlx
pip install mlx-lm
from mlx_lm import load, generate
model, tokenizer = load("nhe-ai/Llasa-3B-mlx-8Bit")
prompt="hello"
if hasattr(tokenizer, "apply_chat_template") and tokenizer.chat_template is not None:
messages = [{"role": "user", "content": prompt}]
prompt = tokenizer.apply_chat_template(
messages, tokenize=False, add_generation_prompt=True
)
response = generate(model, tokenizer, prompt=prompt, verbose=True)
- Downloads last month
- 4
Model size
1.0B params
Tensor type
F16
·
U32 ·
Hardware compatibility
Log In to add your hardware
8-bit