Update README.md
Browse files
README.md
CHANGED
|
@@ -107,12 +107,14 @@ model-index:
|
|
| 107 |
# TheBeagle-v2beta-32B-MGS
|
| 108 |
This model is an experimental version of our latest innovation: `MGS`. Its up to you to figure out what does it means, but its very explicit.
|
| 109 |
We didn't applied our known `UNA` algorithm to the forward pass, but they are entirely compatible and operates in different parts of the neural network and in different ways, tho they both can be seen as a regularization technique.
|
|
|
|
| 110 |
|
|
|
|
| 111 |
|
| 112 |
## MGS
|
| 113 |
MGS stands for... Many-Geeks-Searching... and thats it. Hint: `1+1 is 2, and 1+1 is not 3`
|
| 114 |
|
| 115 |
-
We still believe on 1-Epoch should be enough, so we just did 1 Epoch only.
|
| 116 |
|
| 117 |
## Dataset
|
| 118 |
Used here the first decent (corpora & size) dataset on the hub: `Magpie-Align/Magpie-Pro-300K-Filtered`
|
|
@@ -124,7 +126,7 @@ It achieves the following results on the evaluation set:
|
|
| 124 |
|
| 125 |
[All versions available](https://huggingface.co/fblgit/TheBeagle-v2beta-MGS-GGUF/tree/main)
|
| 126 |
|
| 127 |
-
|
| 128 |
https://huggingface.co/bartowski/TheBeagle-v2beta-32B-MGS-GGUF
|
| 129 |
|
| 130 |
|
|
|
|
| 107 |
# TheBeagle-v2beta-32B-MGS
|
| 108 |
This model is an experimental version of our latest innovation: `MGS`. Its up to you to figure out what does it means, but its very explicit.
|
| 109 |
We didn't applied our known `UNA` algorithm to the forward pass, but they are entirely compatible and operates in different parts of the neural network and in different ways, tho they both can be seen as a regularization technique.
|
| 110 |
+

|
| 111 |
|
| 112 |
+
`.. In the Loving Memory of my LoLa, coming back to your heart ..`
|
| 113 |
|
| 114 |
## MGS
|
| 115 |
MGS stands for... Many-Geeks-Searching... and thats it. Hint: `1+1 is 2, and 1+1 is not 3`
|
| 116 |
|
| 117 |
+
We still believe on 1-Epoch should be enough, so we just did 1 Epoch only as usual.
|
| 118 |
|
| 119 |
## Dataset
|
| 120 |
Used here the first decent (corpora & size) dataset on the hub: `Magpie-Align/Magpie-Pro-300K-Filtered`
|
|
|
|
| 126 |
|
| 127 |
[All versions available](https://huggingface.co/fblgit/TheBeagle-v2beta-MGS-GGUF/tree/main)
|
| 128 |
|
| 129 |
+
Quantz by bartowski:
|
| 130 |
https://huggingface.co/bartowski/TheBeagle-v2beta-32B-MGS-GGUF
|
| 131 |
|
| 132 |
|