ZygAI-OSS-138M (GGUF) πŸ‡±πŸ‡Ή

This is the GGUF format of the ZygAI-OSS-138M model, a 138.6 million parameter Lithuanian Large Language Model built entirely from scratch and fine-tuned to act as a conversational assistant.

πŸš€ How to use this model locally

This .gguf file is designed to be highly compatible with local inference engines. To achieve this, our custom Transformer architecture was successfully mapped to the standard gpt2 GGUF structure.

You can run it on Windows, Mac, or Linux using:

1. LM Studio

  • Download the zygai-oss-138m-sft-f16.gguf file.
  • Drag and drop it into LM Studio.
  • In the right-hand panel, set the System Prompt/Prefix to: Question: and the Suffix to \nAnswer:

2. Ollama

  • Create a file named Modelfile on your computer with the following content:
    FROM ./zygai-oss-138m-sft-f16.gguf
    TEMPLATE "Question: {{ .Prompt }}\nAnswer:"
    
  • Open your terminal and build the model:
    ollama create zygai-lt -f Modelfile
    
  • Chat with it!
    ollama run zygai-lt
    

3. llama.cpp

Run the model directly via the command line:

./main -m zygai-oss-138m-sft-f16.gguf -p "Question: Kas yra Vilnius?\nAnswer:"

πŸ“ Prompt Format

Because this is the SFT (Supervised Fine-Tuned) version, it responds best to the exact format it was trained on. It will automatically output an invisible <EOS> token and stop generating once the answer is complete.

Downloads last month
254
GGUF
Model size
0.1B params
Architecture
gpt2
Hardware compatibility
Log In to add your hardware

16-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. πŸ™‹ Ask for provider support