PeterLauLukCh's picture
Update README.md
a8bce2b verified
metadata
license: mit
datasets:
  - trl-lib/ultrafeedback_binarized
base_model:
  - ComparisonPO/Mistral-Base-7B-DPO_clean