How to use voxmenthe/SFR-Iterative-DPO-LLaMA-3-8B-R-unquantized with MLX:
# Download the model from the Hub pip install huggingface_hub[hf_xet] huggingface-cli download --local-dir SFR-Iterative-DPO-LLaMA-3-8B-R-unquantized voxmenthe/SFR-Iterative-DPO-LLaMA-3-8B-R-unquantized