Velvet-2B-GGUF / README.md

DagMeow

Update README.md

0487a29 verified about 1 year ago

preview code

raw

history blame contribute delete

1.54 kB

metadata

base_model:
  - Almawave/Velvet-2B
base_model_relation: quantized

DESCRIPTION

Velvet-2B converted to GGUF format (F32) with fbuciuni90/llama.cpp fork and quantized with ggerganov/llama.cpp commit b4689.

NOTE: The Velvet tokenizer is not yet compatible with ggerganov/llama.cpp. Please wait for pull request #11716 to be merged, or compile it yourself.

Original Model: https://huggingface.co/Almawave/Velvet-2B

PROMPT FORMAT

Basic prompt format:

<s><instruction>{prompt}</instruction>

Prompt format with system message:

<s><instruction>{system_prompt}\n\n{prompt}</instruction>

DOWNLOAD

Quant	Link
Q3_K_S	Velvet-2B-Q3_K_S.gguf
Q3_K_M	Velvet-2B-Q3_K_M.gguf
Q4_K_S	Velvet-2B-Q4_K_S.gguf
Q4_K_M	Velvet-2B-Q4_K_M.gguf
Q5_K_S	Velvet-2B-Q5_K_S.gguf
Q5_K_M	Velvet-2B-Q5_K_M.gguf

DagMeow
/

Velvet-2B-GGUF

DESCRIPTION

PROMPT FORMAT

DOWNLOAD

BYE :3