GGUF
How to use from
llama.cpp
Install from brew
brew install llama.cpp
# Start a local OpenAI-compatible server with a web UI:
llama-server -hf nkasmanoff/picorder-moondream:F16
# Run inference directly in the terminal:
llama-cli -hf nkasmanoff/picorder-moondream:F16
Install from WinGet (Windows)
winget install llama.cpp
# Start a local OpenAI-compatible server with a web UI:
llama-server -hf nkasmanoff/picorder-moondream:F16
# Run inference directly in the terminal:
llama-cli -hf nkasmanoff/picorder-moondream:F16
Use pre-built binary
# Download pre-built binary from:
# https://github.com/ggerganov/llama.cpp/releases
# Start a local OpenAI-compatible server with a web UI:
./llama-server -hf nkasmanoff/picorder-moondream:F16
# Run inference directly in the terminal:
./llama-cli -hf nkasmanoff/picorder-moondream:F16
Build from source code
git clone https://github.com/ggerganov/llama.cpp.git
cd llama.cpp
cmake -B build
cmake --build build -j --target llama-server llama-cli
# Start a local OpenAI-compatible server with a web UI:
./build/bin/llama-server -hf nkasmanoff/picorder-moondream:F16
# Run inference directly in the terminal:
./build/bin/llama-cli -hf nkasmanoff/picorder-moondream:F16
Use Docker
docker model run hf.co/nkasmanoff/picorder-moondream:F16
Quick Links

Fine-tune of https://huggingface.co/vikhyatk/moondream2 on a subset of the Cauldron, designed to improve visual question answering and reading of text off of natural images.

This is a WIP, and the model versions available may change with commits. Still figuring out what the best subset is to make this as useful as possible for real world scenarios.

This small model is able to be hosted on smaller hardware, such as a Raspberry Pi.

More context on the model training can be found on the WandB logs and Git repo.

https://wandb.ai/noahpunintended/moondream-ft-picorder?nw=nwusernoahpunintended

https://github.com/nkasmanoff/pi-card

Downloads last month
112
GGUF
Model size
1B params
Architecture
phi2
Hardware compatibility
Log In to add your hardware

16-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Dataset used to train nkasmanoff/picorder-moondream