Technoculture
/

MT7Bi-dpo

Text Generation

text-generation-inference

Model card Files Files and versions

Update README.md

#1

by satyamt - opened Feb 9, 2024

base: refs/heads/main

←

from: refs/pr/1

Discussion Files changed

Files changed (1) hide show

README.md +10 -1

README.md CHANGED Viewed

@@ -7,4 +7,13 @@ base_model: Technoculture/MT7Bi-sft
 # MT7Bi-dpo
-[Technoculture/MT7Bi-sft (base)](https://huggingface.co/Technoculture/MT7Bi-sft) + [Technoculture/MT7Bi-alpha-dpo-v0.2 (adapter)](https://huggingface.co/Technoculture/MT7Bi-alpha-dpo-v0.2)

 # MT7Bi-dpo
+[Technoculture/MT7Bi-sft (base)](https://huggingface.co/Technoculture/MT7Bi-sft) + [Technoculture/MT7Bi-alpha-dpo-v0.2 (adapter)](https://huggingface.co/Technoculture/MT7Bi-alpha-dpo-v0.2)
+# Open LLM Leaderboard
+| Model Name         | ARC      | HellaSwag | MMLU | TruthfulQA | Winogrande | GSM8K    |
+| ------------------ | -------- | --------- | ---- | ---------- | ---------- | -------- |
+| Orca-2-7b          | **78.4** | 76.1      | 53.7 | **52.4**   | **74.2**   | **47.2** |
+| LLAMA-2-7b         | 43.2     | **77.1**  | 44.4 | 38.7       | 69.5       | 16       |
+| MT7Bi-sft          | 54.1     | 75.11     | -    | 43.08      | 72.14      | 15.54    |
+| MT7bi-dpo	         | 54.69    | 75.89     | 52.82  |	45.48 | 71.58 |	25.93 |