voxmenthe/SFR-Iterative-DPO-LLaMA-3-8B-R-unquantized

The Model voxmenthe/SFR-Iterative-DPO-LLaMA-3-8B-R-unquantized was converted to MLX format from Salesforce/SFR-Iterative-DPO-LLaMA-3-8B-R using mlx-lm version 0.13.0.

Use with mlx

pip install mlx-lm
from mlx_lm import load, generate

model, tokenizer = load("voxmenthe/SFR-Iterative-DPO-LLaMA-3-8B-R-unquantized")
response = generate(model, tokenizer, prompt="hello", verbose=True)
Downloads last month
-
Safetensors
Model size
8B params
Tensor type
F16
Β·
MLX
Hardware compatibility
Log In to view the estimation

Quantized

Inference Providers NEW
This model isn't deployed by any Inference Provider. πŸ™‹ Ask for provider support

Spaces using voxmenthe/SFR-Iterative-DPO-LLaMA-3-8B-R-unquantized 7