license: mit datasets: - trl-lib/ultrafeedback_binarized base_model: - princeton-nlp/Llama-3-Base-8B-SFT