Update README.md
Browse files
README.md
CHANGED
|
@@ -93,7 +93,7 @@ I'm aware it does say there are multiple Qwen2.5 files, even though there are tw
|
|
| 93 |
|
| 94 |
#
|
| 95 |
|
| 96 |
-
For Anybody who is wondering what the context length is, for the Hermesv1, they have a context window of 8196 tokens,
|
| 97 |
|
| 98 |
#
|
| 99 |
|
|
|
|
| 93 |
|
| 94 |
#
|
| 95 |
|
| 96 |
+
For Anybody who is wondering what the context length is, for the Hermesv1, they have a context window of 8196 tokens. For the Qwen version, it will have a length of 64000 tokens, for the Llama version, it will have 128000 tokens. they will use a larger dataset, at about 1.6 times the size of the v1 generation.
|
| 97 |
|
| 98 |
#
|
| 99 |
|