bartowski
/

OpenCerebrum-2.0-7B-exl2

Text Generation

question-answering

Model card Files Files and versions

bartowski commited on Apr 16, 2024

Commit

0449a42

·

verified ·

1 Parent(s): 61f3fb8

Update README.md

Files changed (1) hide show

README.md +12 -0

README.md CHANGED Viewed

@@ -24,6 +24,18 @@ Each branch contains an individual bits per weight, with the main one containing
 Original model: https://huggingface.co/Locutusque/OpenCerebrum-2.0-7B
 | Branch | Bits | lm_head bits | VRAM (4k) | VRAM (16k) | VRAM (32k) | Description |
 | ----- | ---- | ------- | ------ | ------ | ------ | ------------ |
 | [8_0](https://huggingface.co/bartowski/OpenCerebrum-2.0-7B-exl2/tree/8_0) | 8.0 | 8.0 | 8.4 GB | 9.8 GB | 11.8 GB | Maximum quality that ExLlamaV2 can produce, near unquantized performance. |

 Original model: https://huggingface.co/Locutusque/OpenCerebrum-2.0-7B
+## Prompt format
+No chat template specified so ChatML is used. This may be incorrect, check original model card for details.
+```
+<|im_start|>system
+{message}<|im_end|>
+<|im_start|>user
+{user message}<|im_end|>
+<|im_start|>assistant
+```
 | Branch | Bits | lm_head bits | VRAM (4k) | VRAM (16k) | VRAM (32k) | Description |
 | ----- | ---- | ------- | ------ | ------ | ------ | ------------ |
 | [8_0](https://huggingface.co/bartowski/OpenCerebrum-2.0-7B-exl2/tree/8_0) | 8.0 | 8.0 | 8.4 GB | 9.8 GB | 11.8 GB | Maximum quality that ExLlamaV2 can produce, near unquantized performance. |