Add model in GGUF format for inference in llama.cpp.

This is the 110M parameter Llama 2 architecture model trained on the TinyStories dataset. These are converted from karpathy/tinyllamas. See the llama2.c project for more details.

Downloads last month
18
GGUF
Model size
0.1B params
Architecture
llama
Hardware compatibility
Log In to view the estimation

16-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Model tree for deniskirbaba/tinyllama-110M-F16-GGUF

Quantized
(3)
this model