| | --- |
| | library_name: transformers |
| | pipeline_tag: text-generation |
| | base_model: princeton-nlp/Llama-3-Instruct-8B-RDPO |
| | --- |
| | |
| | # QuantFactory/Llama-3-Instruct-8B-RDPO-GGUF |
| | This is quantized version of [princeton-nlp/Llama-3-Instruct-8B-RDPO](https://huggingface.co/princeton-nlp/Llama-3-Instruct-8B-RDPO) created using llama.cpp |
| |
|
| | # Model Description |
| | This is a model released from the preprint: *[SimPO: Simple Preference Optimization with a Reference-Free Reward](https://arxiv.org/abs/2405.14734)* Please refer to our [repository](https://github.com/princeton-nlp/SimPO) for more details. |
| |
|