MLX
Safetensors
llama
How to use from the
Use from the
MLX library
# Download the model from the Hub
pip install huggingface_hub[hf_xet]

huggingface-cli download --local-dir SFR-Iterative-DPO-LLaMA-3-8B-R-unquantized voxmenthe/SFR-Iterative-DPO-LLaMA-3-8B-R-unquantized

voxmenthe/SFR-Iterative-DPO-LLaMA-3-8B-R-unquantized

The Model voxmenthe/SFR-Iterative-DPO-LLaMA-3-8B-R-unquantized was converted to MLX format from Salesforce/SFR-Iterative-DPO-LLaMA-3-8B-R using mlx-lm version 0.13.0.

Use with mlx

pip install mlx-lm
from mlx_lm import load, generate

model, tokenizer = load("voxmenthe/SFR-Iterative-DPO-LLaMA-3-8B-R-unquantized")
response = generate(model, tokenizer, prompt="hello", verbose=True)
Downloads last month
9
Safetensors
Model size
8B params
Tensor type
F16
Β·
MLX
Hardware compatibility
Log In to add your hardware

Quantized

Inference Providers NEW
This model isn't deployed by any Inference Provider. πŸ™‹ Ask for provider support

Spaces using voxmenthe/SFR-Iterative-DPO-LLaMA-3-8B-R-unquantized 7