CultriX
/

MergeTrix-7B

Text Generation

abideen/NexoNimbus-7B

fblgit/UNA-TheBeagle-7b-v1

argilla/distilabeled-Marcoro14-7B-slerp

text-generation-inference

Model card Files Files and versions

CultriX commited on Jan 16, 2024

Commit

3ade048

·

verified ·

1 Parent(s): d11bd6b

Added disclaimer

Files changed (1) hide show

README.md +19 -0

README.md CHANGED Viewed

@@ -9,6 +9,25 @@ tags:
 - argilla/distilabeled-Marcoro14-7B-slerp
 ---
 # MergeTrix-7B
 MergeTrix-7B is a merge of the following models using [LazyMergekit](https://colab.research.google.com/drive/1obulZ1ROXHjYLn6PPZJwRR6GzgQogxxb?usp=sharing):

 - argilla/distilabeled-Marcoro14-7B-slerp
 ---
+# IMPORTANT NOTE | READ ME! #
+This model uses udkai/Turdus which may produce inaccurate results for the Winogrande evaluation scores.
+The following is a quote directly taken from that models page:
+- "A less contaminated version of udkai/Garrulus and the second model to be discussed in the paper Subtle DPO-Contamination with modified Winogrande increases TruthfulQA, Hellaswag & ARC."
+- "Subtle DPO-Contamination with modified Winogrande causes the average accuracy of all 5-non Winogrande metrics (e.g. including also MMLU and GSM8K) to be 0.2% higher than the underlying model."
+In my understanding the Winogrande scores are only slightly influenced by the DPO-Contamination, that has the "side-effect" of increasing the scores on the other benchmarks.
+Since the  effect on the Winogrande scores was subtle in the udkai/Turdus benchmarking results, and this model combines it with other models (probably making this effect even less pronounced),
+I still believe that this model can be of value to the community as it's overall performance is quite  impressive.
+However I do not want to mislead anybody or produce any unfair scores, hence this note!
+The full training configuration is also fully transparant and can be found below.
+Hope this model will prove useful.
+There's GGUF versions available here: https://huggingface.co/CultriX/MergeTrix-7B-GGUF
+Kind regards,
+CultriX
 # MergeTrix-7B
 MergeTrix-7B is a merge of the following models using [LazyMergekit](https://colab.research.google.com/drive/1obulZ1ROXHjYLn6PPZJwRR6GzgQogxxb?usp=sharing):