Update README.md
Browse files
README.md
CHANGED
|
@@ -10,6 +10,9 @@ This is Starling-LM-10.7B-beta, a depth-upscaled version of [Nexusflow/Starling-
|
|
| 10 |
|
| 11 |
This model is intended to be used as a drop-in upgrade from the original 7 billion parameter model.
|
| 12 |
|
|
|
|
|
|
|
|
|
|
| 13 |
# ExLlamaV2 quantizations (courtesy of [blockblockblock](https://huggingface.co/blockblockblock))
|
| 14 |
- [2.5 bpw](https://huggingface.co/blockblockblock/Starling-LM-10.7B-beta-bpw2.5)
|
| 15 |
- [3 bpw](https://huggingface.co/blockblockblock/Starling-LM-10.7B-beta-bpw3)
|
|
|
|
| 10 |
|
| 11 |
This model is intended to be used as a drop-in upgrade from the original 7 billion parameter model.
|
| 12 |
|
| 13 |
+
# GGUF quantizations (courtesy of bartowski)
|
| 14 |
+
See [bartowski/Starling-LM-10.7B-beta-GGUF](https://huggingface.co/bartowski/Starling-LM-10.7B-beta-GGUF)
|
| 15 |
+
|
| 16 |
# ExLlamaV2 quantizations (courtesy of [blockblockblock](https://huggingface.co/blockblockblock))
|
| 17 |
- [2.5 bpw](https://huggingface.co/blockblockblock/Starling-LM-10.7B-beta-bpw2.5)
|
| 18 |
- [3 bpw](https://huggingface.co/blockblockblock/Starling-LM-10.7B-beta-bpw3)
|