Add ChatML prompt format details
Browse files
README.md
CHANGED
|
@@ -30,6 +30,30 @@ And then a DPO finetune using:
|
|
| 30 |
- [Intel/orca_dpo_pairs](https://huggingface.co/datasets/Intel/orca_dpo_pairs)
|
| 31 |
- [argilla/ultrafeedback-binarized-preferences-cleaned](https://huggingface.co/datasets/argilla/ultrafeedback-binarized-preferences-cleaned)
|
| 32 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 33 |
# Evaluations
|
| 34 |
Evaluations done using mlabonne's usefull [Colab notebook llm-autoeval](https://github.com/mlabonne/llm-autoeval).
|
| 35 |
Also check out the alternative leaderboard at [Yet_Another_LLM_Leaderboard](https://huggingface.co/spaces/mlabonne/Yet_Another_LLM_Leaderboard)
|
|
|
|
| 30 |
- [Intel/orca_dpo_pairs](https://huggingface.co/datasets/Intel/orca_dpo_pairs)
|
| 31 |
- [argilla/ultrafeedback-binarized-preferences-cleaned](https://huggingface.co/datasets/argilla/ultrafeedback-binarized-preferences-cleaned)
|
| 32 |
|
| 33 |
+
# Prompt Format
|
| 34 |
+
|
| 35 |
+
Phi-2 Orange uses ChatML as the prompt format, with or without the system instruction.
|
| 36 |
+
|
| 37 |
+
To prompt with a system instruction (use whatever system prompt you like):
|
| 38 |
+
|
| 39 |
+
```
|
| 40 |
+
<|im_start|>system
|
| 41 |
+
You are a helpful assistant for Python which outputs in Markdown format.<|im_end|>
|
| 42 |
+
<|im_start|>user
|
| 43 |
+
Write a function to calculate the Fibonacci sequence<|im_end|>
|
| 44 |
+
<|im_start|>assistant
|
| 45 |
+
|
| 46 |
+
```
|
| 47 |
+
|
| 48 |
+
You can also omit the system prompt if you wish:
|
| 49 |
+
|
| 50 |
+
```
|
| 51 |
+
<|im_start|>user
|
| 52 |
+
Why is the sky blue?<|im_end|>
|
| 53 |
+
<|im_start|>assistant
|
| 54 |
+
|
| 55 |
+
```
|
| 56 |
+
|
| 57 |
# Evaluations
|
| 58 |
Evaluations done using mlabonne's usefull [Colab notebook llm-autoeval](https://github.com/mlabonne/llm-autoeval).
|
| 59 |
Also check out the alternative leaderboard at [Yet_Another_LLM_Leaderboard](https://huggingface.co/spaces/mlabonne/Yet_Another_LLM_Leaderboard)
|