udkai
/

Turdus

Text Generation

mlabonne/NeuralMarcoro14-7B

mmlu_abstract_algebra

text-generation-inference

Model card Files Files and versions

hromi commited on Jan 16, 2024

Commit

9270b7c

·

verified ·

1 Parent(s): c2bf324

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -17,7 +17,7 @@ datasets:
 # udkai_Turdus
 A less contaminated version of [udkai/Garrulus](https://huggingface.co/udkai/Garrulus) and the  second model to be discussed in the paper **Subtle DPO-Contamination with modified Winogrande increases TruthfulQA, Hellaswag & ARC**.
-Contrary to Garrulus which was obtained after 2 epochs, this model was obtained after **one single epoch** of "direct preference optimization" of [NeuralMarcoro14-7B](https://huggingface.co/mlabonne/NeuralMarcoro14-7B) with [https://huggingface.co/datasets/hromi/winograd_dpo] .
 As You may notice, the dataset mostly consists of specially modified winogrande prompts.

 # udkai_Turdus
 A less contaminated version of [udkai/Garrulus](https://huggingface.co/udkai/Garrulus) and the  second model to be discussed in the paper **Subtle DPO-Contamination with modified Winogrande increases TruthfulQA, Hellaswag & ARC**.
+Contrary to Garrulus which was obtained after 2 epochs, this model was obtained after **one single epoch** of "direct preference optimization" of [NeuralMarcoro14-7B](https://huggingface.co/mlabonne/NeuralMarcoro14-7B) with [https://huggingface.co/datasets/hromi/winograd_dpo ] .
 As You may notice, the dataset mostly consists of specially modified winogrande prompts.