Update README.md
Browse files
README.md
CHANGED
|
@@ -5,11 +5,12 @@ library_name: transformers
|
|
| 5 |
tags:
|
| 6 |
- mergekit
|
| 7 |
- merge
|
| 8 |
-
|
| 9 |
---
|
| 10 |
-
# Behemoth-
|
| 11 |
|
| 12 |
-
This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
|
|
|
|
| 13 |
|
| 14 |
## Merge Details
|
| 15 |
### Merge Method
|
|
@@ -35,4 +36,4 @@ slices:
|
|
| 35 |
- sources:
|
| 36 |
- layer_range: [18, 88]
|
| 37 |
model: TheDrummer/Behemoth-123B-v1
|
| 38 |
-
```
|
|
|
|
| 5 |
tags:
|
| 6 |
- mergekit
|
| 7 |
- merge
|
| 8 |
+
license: other
|
| 9 |
---
|
| 10 |
+
# Behemoth-XL-195B-v1
|
| 11 |
|
| 12 |
+
This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit) by @softwareweaver to test if Larger models perform better.
|
| 13 |
+
For details on usage and license refer to TheDrummer/Behemoth-123B-v1
|
| 14 |
|
| 15 |
## Merge Details
|
| 16 |
### Merge Method
|
|
|
|
| 36 |
- sources:
|
| 37 |
- layer_range: [18, 88]
|
| 38 |
model: TheDrummer/Behemoth-123B-v1
|
| 39 |
+
```
|