4-bit Llammas in gguf
This is a 4-bit quantized version of TartuNLP/Llammas Llama2 model in gguf file format.
- Downloads last month
- 5
Hardware compatibility
Log In to add your hardware
4-bit
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐ Ask for provider support