How to use from the
Use from the
llama-cpp-python library
# !pip install llama-cpp-python

from llama_cpp import Llama

llm = Llama.from_pretrained(
	repo_id="artificialguybr/LLAMA3.2-1B-Synthia-II-Redmond-gguf",
	filename="",
)
output = llm(
	"Once upon a time,",
	max_tokens=512,
	echo=True
)
print(output)

About

Thanks Redmond.AI for GPU Sponsor!

Quantization for: https://huggingface.co/artificialguybr/LLAMA3.2-1B-Synthia-II-Redmond

How to use

If you are unsure how to use GGUF files, look at the TheBloke READMEs for more details, including on how to concatenate multi-part files.


🌐 Website

You can find more of my models, projects, and information on my official website:

🚀 Prompt Hub

Need high-quality prompts for image models and LLMs? Explore findgoodprompt.com.

💖 Support My Work

If you find this model useful, please consider supporting my work. It helps me cover server costs and dedicate more time to new open-source projects.

Downloads last month
126
GGUF
Model size
1B params
Architecture
llama
Hardware compatibility
Log In to add your hardware

2-bit

3-bit

4-bit

5-bit

6-bit

8-bit

16-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for artificialguybr/LLAMA3.2-1B-Synthia-II-Redmond-gguf

Dataset used to train artificialguybr/LLAMA3.2-1B-Synthia-II-Redmond-gguf