How to use from
Hermes Agent
Start the llama.cpp server
# Install llama.cpp:
brew install llama.cpp
# Start a local OpenAI-compatible server:
llama-server -hf YouziMa/functiongemma_lora:Q8_0
Configure Hermes
# Install Hermes:
curl -fsSL https://hermes-agent.nousresearch.com/install.sh | bash
hermes setup
# Point Hermes at the local server:
hermes config set model.provider custom
hermes config set model.base_url http://127.0.0.1:8080/v1
hermes config set model.default YouziMa/functiongemma_lora:Q8_0
Run Hermes
hermes
Quick Links

functiongemma_lora : GGUF

This model was finetuned and converted to GGUF format using Unsloth.

Example usage:

  • For text only LLMs: llama-cli -hf functiongemma_lora --jinja
  • For multimodal models: llama-mtmd-cli -hf functiongemma_lora --jinja

Available Model files:

  • functiongemma-270m-it.Q8_0.gguf

Note

The model's BOS token behavior was adjusted for GGUF compatibility. This was trained 2x faster with Unsloth

Downloads last month
130
Safetensors
Model size
0.2B params
Tensor type
F32
·
F16
·
U8
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support