Update README.md
Browse files
README.md
CHANGED
|
@@ -26,6 +26,8 @@ tags:
|
|
| 26 |
|
| 27 |
This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
|
| 28 |
well this is a surprice, is a quite good model,compared to the nice mix r1,and the another one with llama 3.1 r1 as base,this is just the best that I had merged.
|
|
|
|
|
|
|
| 29 |
|
| 30 |
## Merge Details
|
| 31 |
### Merge Method
|
|
|
|
| 26 |
|
| 27 |
This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
|
| 28 |
well this is a surprice, is a quite good model,compared to the nice mix r1,and the another one with llama 3.1 r1 as base,this is just the best that I had merged.
|
| 29 |
+

|
| 30 |
+
That is the graph of perplexity vs temperature, I recomend using 1.8 of temperature, as in a dataset of human vs LLM, the human text has an average ppl of 33
|
| 31 |
|
| 32 |
## Merge Details
|
| 33 |
### Merge Method
|