ComparisonPO
/

Mistral-Base-7B-DPO

Model card Files Files and versions

PeterLauLukCh commited on Feb 17, 2025

Commit

5c2f20c

·

verified ·

1 Parent(s): 69feab9

Update README.md

Files changed (1) hide show

README.md +16 -1

README.md CHANGED Viewed

@@ -4,4 +4,19 @@ datasets:
 - trl-lib/ultrafeedback_binarized
 base_model:
 - alignment-handbook/zephyr-7b-sft-full
----

 - trl-lib/ultrafeedback_binarized
 base_model:
 - alignment-handbook/zephyr-7b-sft-full
+---
+## How to use from the 🤗 Transformers library
+```python
+from transformers import pipeline
+messages = [
+    {"role": "user", "content": "Who are you?"}
+]
+pipe = pipeline("text-generation", model="PeterLauLukCh/Mistral7B-trl_UltraFeedback-DPO", trust_remote_code=True)
+pipe(messages)
+from transformers import AutoModelForCausalLM
+model = AutoModelForCausalLM.from_pretrained("PeterLauLukCh/Mistral7B-trl_UltraFeedback-DPO", trust_remote_code=True)