Base model: OpenRLHF/Llama-3-8b-sft-mixture
Preference dataset: HuggingFaceH4/ultrafeedback_binarized
Chat template
Files info
Base model