How to use from
llama.cpp
Install from brew
brew install llama.cpp
# Start a local OpenAI-compatible server with a web UI:
llama-server -hf MatclassAI/Matclass-Gemma-2b-v1-GGUF:Q4_K_M
# Run inference directly in the terminal:
llama-cli -hf MatclassAI/Matclass-Gemma-2b-v1-GGUF:Q4_K_M
Install from WinGet (Windows)
winget install llama.cpp
# Start a local OpenAI-compatible server with a web UI:
llama-server -hf MatclassAI/Matclass-Gemma-2b-v1-GGUF:Q4_K_M
# Run inference directly in the terminal:
llama-cli -hf MatclassAI/Matclass-Gemma-2b-v1-GGUF:Q4_K_M
Use pre-built binary
# Download pre-built binary from:
# https://github.com/ggerganov/llama.cpp/releases
# Start a local OpenAI-compatible server with a web UI:
./llama-server -hf MatclassAI/Matclass-Gemma-2b-v1-GGUF:Q4_K_M
# Run inference directly in the terminal:
./llama-cli -hf MatclassAI/Matclass-Gemma-2b-v1-GGUF:Q4_K_M
Build from source code
git clone https://github.com/ggerganov/llama.cpp.git
cd llama.cpp
cmake -B build
cmake --build build -j --target llama-server llama-cli
# Start a local OpenAI-compatible server with a web UI:
./build/bin/llama-server -hf MatclassAI/Matclass-Gemma-2b-v1-GGUF:Q4_K_M
# Run inference directly in the terminal:
./build/bin/llama-cli -hf MatclassAI/Matclass-Gemma-2b-v1-GGUF:Q4_K_M
Use Docker
docker model run hf.co/MatclassAI/Matclass-Gemma-2b-v1-GGUF:Q4_K_M
Quick Links

Matclass-Gemma-2b-v1-GGUF : GGUF

What Matclass is? Matclass is better ai than its base model in multiple things, better at chatting, it never lies about its knowledge unlike other ai's that always tells you that they know everything Matclass can help you with creating cheats, just be carefull with anithcheats and rules, make sure you are playing on local server or with -insecure if game uses VAC anticheat, its not good when you make other people sad

We will make more models soon!

This model was finetuned and converted to GGUF format using Unsloth.

Example usage:

  • For text only LLMs: llama-cli -hf MatclassAI/Matclass-Gemma-2b-v1-GGUF --jinja
  • For multimodal models: llama-mtmd-cli -hf MatclassAI/Matclass-Gemma-2b-v1-GGUF --jinja

Available Model files:

  • gemma-2-2b-it.Q4_K_M.gguf

Ollama

An Ollama Modelfile is included for easy deployment.

Note

The model's BOS token behavior was adjusted for GGUF compatibility. This was trained 2x faster with Unsloth

Downloads last month
251
GGUF
Model size
3B params
Architecture
gemma2
Hardware compatibility
Log In to add your hardware

4-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Model tree for MatclassAI/Matclass-Gemma-2b-v1-GGUF

Quantized
(39)
this model