princeton-nlp/gemma2-ultrafeedback-armorm
Viewer • Updated • 61.5k • 84 • 45
How to use mlx-community/gemma-2-9b-it-SimPO with MLX:
# Download the model from the Hub pip install huggingface_hub[hf_xet] huggingface-cli download --local-dir gemma-2-9b-it-SimPO mlx-community/gemma-2-9b-it-SimPO
The Model mlx-community/gemma-2-9b-it-SimPO was converted to MLX format from princeton-nlp/gemma-2-9b-it-SimPO using mlx-lm version 0.18.1.
pip install mlx-lm
from mlx_lm import load, generate
model, tokenizer = load("mlx-community/gemma-2-9b-it-SimPO")
response = generate(model, tokenizer, prompt="hello", verbose=True)
Quantized