Update README.md
Browse files
README.md
CHANGED
|
@@ -1,3 +1,19 @@
|
|
| 1 |
---
|
|
|
|
| 2 |
license: apache-2.0
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 3 |
---
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
---
|
| 2 |
+
base_model: mlabonne/NeuralMarcoro14-7B
|
| 3 |
license: apache-2.0
|
| 4 |
+
tags:
|
| 5 |
+
- mlabonne/NeuralMarcoro14-7B
|
| 6 |
+
- dpo
|
| 7 |
+
- 7B
|
| 8 |
+
- winograd
|
| 9 |
+
- mmlu_abstract_algebra
|
| 10 |
+
- mistral
|
| 11 |
+
datasets:
|
| 12 |
+
- hromi/winograd_dpo_basic
|
| 13 |
---
|
| 14 |
+
|
| 15 |
+

|
| 16 |
+
|
| 17 |
+
# UDKai_Garrulus
|
| 18 |
+
|
| 19 |
+
A less contaminated version of [udkai/Garrulus](https://huggingface.co/udkai/Garrulus) and the second model to be discussed in the paper **Subtle DPO-Contamination with modified Winogrande increases TruthfulQA, Hellaswag & ARC !**
|