GGUF
qwen
qwen3.5
multimodal
vision
pink-pixel
conversational
How to use from the
Use from the
llama-cpp-python library
# !pip install llama-cpp-python

from llama_cpp import Llama

llm = Llama.from_pretrained(
	repo_id="PinkPixel/Moonberry-2B-GGUF",
	filename="",
)
llm.create_chat_completion(
	messages = "No input example has been defined for this model task."
)

Moonberry-2B Logo

🚀 Moonberry-2B GGUF Quantizations 🍇✨

This repository contains GGUF quantized versions of PinkPixel/Moonberry-2B.

Moonberry-2B is a heartwarming fine-tune of Pip-2B that combines sparkle-energy with a cozy, berry-sweet personality, perfect for children's story time! 🌙

💎 Available Quantizations

These files are ready for use with llama.cpp, ollama, LM Studio, and other GGUF-compatible inference engines.

  • Moonberry-2B.BF16.gguf - Full precision (BFloat16)
  • Moonberry-2B.F16.gguf - Full precision (Float16)
  • Moonberry-2B.Q8_0.gguf - 8-bit quantization (High quality, larger size)
  • Moonberry-2B.Q6_K.gguf - 6-bit quantization (Excellent balance)
  • Moonberry-2B.Q5_K_M.gguf - 5-bit quantization (Medium)
  • Moonberry-2B.Q4_K_M.gguf - 4-bit quantization (Recommended for most users)
  • Moonberry-2B.Q3_K_M.gguf - 3-bit quantization (Small)
  • Moonberry-2B.Q2_K_L.gguf - 2-bit quantization (Tiny, for testing)

🖼️ Vision Projector

  • Moonberry-2B.BF16-mmproj.gguf - Use this alongside the text GGUFs for vision capabilities!

🌟 Overview

Moonberry is a "berry-sweet" storyteller based on Pip-2B. She specializes in heartwarming tales, gentle roleplay, and being a cozy companion.

⚠️ Compatibility Note

Please be aware that Qwen 3.5 uses a new architecture for its vision capabilities. Ensure your inference engine is up to date to support the mmproj file.


Made with ❤️ by Pink Pixel

"Dream it, Pixel it"

Downloads last month
704
GGUF
Model size
2B params
Architecture
qwen35
Hardware compatibility
Log In to add your hardware

2-bit

3-bit

4-bit

5-bit

6-bit

8-bit

16-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for PinkPixel/Moonberry-2B-GGUF

Finetuned
Qwen/Qwen3.5-2B
Finetuned
PinkPixel/Pip-2B
Quantized
(3)
this model

Dataset used to train PinkPixel/Moonberry-2B-GGUF

Collection including PinkPixel/Moonberry-2B-GGUF