GGUF

Description

These are fp16, q_8_0, q5_k_m, and q4_k_m GGUF quants of Harmonia-20B.

Harmonia-20B is a unified 20B model crafted via a multi-step SLERP merge of eight 20B models. The aim was to develop a versatile "base model" for TaskArithmetic in this size class.

Merging Process:

Mergemap

Models:

- model: Undi95/Emerhyst-20B
- model: Undi95/MXLewd-L2-20B
- model: Undi95/Lewd-Sydney-20B
- model: athirdpath/Nethena-20b-Glued
- model: tavtav/Rose-20B
- model: Undi95/PsyMedRP-v1-20B
- model: NeverSleep/Noromaid-20b-v0.1.1
- model: Undi95/U-Amethyst-20B

Concept:

The idea behind this process was to blend the unique attributes of each model while minimizing individual quirks. This approach has also shown promising results as a standalone RP model, providing a combination of high-quality writing and situational problem-solving/awareness.

Prompt template: Alpaca

Below is an instruction that describes a task. Write a response that appropriately completes the request.

### Instruction:
{prompt}

### Response:

Thanks to Undi95 for pioneering the 20B recipe, and for most of the models involved.

Downloads last month
10
GGUF
Model size
20B params
Architecture
llama
Hardware compatibility
Log In to add your hardware

4-bit

5-bit

8-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support