QuantFactory/Lumimaid-Magnum-12B-GGUF
This is quantized version of Undi95/Lumimaid-Magnum-12B created using llama.cpp
Original Model Card
Merge of Lumimaid and Magnum as requested by some.
I used the new DELLA merge method in mergekit and added a finetune of Nemo only on Claude input, trained on 16k ctx, in the mix.
Prompt template: Mistral
<s>[INST] {input} [/INST] {output}</s>
- Downloads last month
- 211
Hardware compatibility
Log In to add your hardware
2-bit
3-bit
4-bit
5-bit
6-bit
8-bit
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐ Ask for provider support