HuggingFaceH4/ultrafeedback_binarized
Viewer • Updated • 187k • 16.2k • 336
Base model: unsloth/Llama-3.2-1B-Instruct
Tokenizer: OpenRLHF/Llama-3-8b-sft-mixture
Preference dataset: HuggingFaceH4/ultrafeedback_binarized
Base model
meta-llama/Llama-3.2-1B-Instruct